PeriodicKillSecondaries is not compatible with CleanEveryN hook, causing the server to hang

XMLWordPrintableJSON

    • Type: Bug
    • Resolution: Unresolved
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Replication
    • ALL
    • Repl 2025-06-09, Repl 2025-06-23, Repl 2025-07-07, Repl 2025-07-21, Repl 2025-08-04
    • 200
    • None
    • 3
    • TBD
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • 0

      CleanEveryN hook is typically used in various suites to clean up the data each N test runs, to avoid garbage pilling up in the system.

      Once we introduced CleanEveryN hook as part of change_streams_pre_images_replica_sets_kill_secondary_jscore_passthrough we got consistent hangs in CleanEveryN runs, each failing with the timeout.

      Based on the hang analyzer stacktrace, it appears that we are hanging onĀ 
      ReplicationCoordinatorExternalStateImpl::_stopDataReplication()
      Job logs (ref)

      Hang analyzer stacktrace (ref)

              Assignee:
              Sean Zimmerman
              Reporter:
              Denis Grebennicov
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated: