-
Type:
Improvement
-
Resolution: Unresolved
-
Priority:
Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
Cluster Scalability
-
None
-
None
-
None
-
None
-
None
-
None
-
None
The current coordinator logic considers includes the following step in the critical section:
- Telling donor to block writes.
- Wait for all recipients to transition to strict consistency.
- Tell all participants to commit.
- The donors-only participant drop the collection.
- The donor/recipient participants rename the collection to the targaet collection.
- All participants do some cleanup to remove/drop documents/collections that are related to resharding.
- Coordinator waits for cleanup to finish.
- Coordinator does it's own cleanup.
- Coordinator cancels the critical section timeout.
In reality, the critical section was already over the moment the recipients finished renaming the collections. We could make the participants report to the coordinator that it is "done" earlier and make the coordinator end the critical section before it starts doing it's own cleanup.
- related to
-
SERVER-111814 Investigate why resharding critical section took 10 seconds
-
- Closed
-