-
Type: Bug
-
Resolution: Fixed
-
Priority: Major - P3
-
Affects Version/s: None
-
Component/s: Testing Infrastructure
-
Fully Compatible
-
v4.2, v4.0
-
-
STM 2019-07-01
-
18
-
3
Before running workload teardowns, the fsm runner's main thread
But the continuous stepdown thread does the following:
- checks for the "stepdown permitted file"
- on starting a stepdown round, writes the "stepping down file"
- on completing the stepdown round, removes the "stepping down file."
This allows the following interleaving:
- continuous stepdown thread checks for "stepdown permitted file" and sees it
- fsm runner thread removes "stepdown permitted file"
- fsm runner thread checks for "stepping down file" and doesn't see it
- fsm runner thread starts executing a workload's teardown
- continuous stepdown thread starts a stepdown round, which can cause the workload's teardown thread to get a network error|
- causes
-
SERVER-42195 Stepdown suites fail with Python exception when run with --repeat >1
- Closed
-
SERVER-72957 stepdown suites logs are polluted with non relevant error messages
- Closed
- is depended on by
-
SERVER-39993 Add kill and terminate versions of concurrency step down suites
- Closed
- is related to
-
SERVER-39770 FSM connection cache setup can fail with step down
- Closed
-
SERVER-34555 Migrate concurrency_sharded_with_stepdowns{,_and_balancer}.yml test suites to run directly via resmoke.py
- Closed