[SERVER-70585] ContinuousStepdown can run indefinitely if mongod crashes while stepping down Created: 14/Oct/22  Updated: 29/Oct/23  Resolved: 07/Nov/22

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: 6.2.0-rc0

Type: Task Priority: Major - P3
Reporter: Vishnu Kaushik Assignee: Vishnu Kaushik
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Backports
Related
related to SERVER-48176 Process invariant should fail the res... Backlog
Backwards Compatibility: Fully Compatible
Backport Requested:
v6.0, v5.0, v4.4, v4.2
Sprint: Repl 2022-10-31, Repl 2022-11-14
Participants:
Linked BF Score: 120

 Description   

If mongod crashes while stepping down, this while True loop will run forever. This means that eventually Evergreen will have to kill the process and that can make it harder to diagnose the problem since we'll have a lot more logs / the logs might be missing because the logs grew too large.

We should change that loop so that it retries for a fixed amount of time, similar to what is being done for step up by the hook here.



 Comments   
Comment by Githook User [ 07/Nov/22 ]

Author:

{'name': 'kauboy26', 'email': 'vishnu.kaushik@mongodb.com', 'username': 'kauboy26'}

Message: SERVER-70585 ContinuousStepdown can run indefinitely if mongod crashes while stepping down
Branch: master
https://github.com/mongodb/mongo/commit/09aeac0a7d1f187737282bd2c2b0874341cb7412

Generated at Thu Feb 08 06:16:33 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.