Add timeout to RSTLKillOpThread during step down

XMLWordPrintableJSON

    • Type: Task
    • Resolution: Unresolved
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Replication
    • Repl 2026-02-02
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      SERVER-113421 and the associated HELP ticket found that the RSTLKillOp thread can get stuck for a long amount of time (2+ hours in the HELP case) due to being unable to checkout the session. SERVER-113421 requires further investigation to determine an RCA, but we decided to make the error handling more specific by adding a timeout and loudly failing stepdown with lots of observability to handle this case if it ever happens again.

            Assignee:
            Ruchitha Rajaghatta
            Reporter:
            Ruchitha Rajaghatta
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated: