-
Type:
Task
-
Resolution: Unresolved
-
Priority:
Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
None
-
Cluster Scalability
-
None
-
None
-
None
-
None
-
None
-
None
-
None
As part of fixing SERVER-118706, we should add tests that inject random failures into the appliers and fetchers at key points (such as when inserting into the oplog buffer) to ensure that resharding is robust to these failures and doesn't miscount fetched/applied.
We should also refactor the code so the test and failure injection can be done in cpp tests. Some previous examples can be found here.
- is related to
-
SERVER-118706 [v8.0] Fix resharding hang where critical section never engages due to oplogEntriesFetched overcount
-
- Backlog
-