Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.3.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Cluster Scalability
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
ClusterScalability Oct13-Oct27, ClusterScalability Nov10-Nov24, ClusterScalability Nov24-Dec8
Story Points:
5
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

~~SERVER-91109~~ introduced a way to reduce the performance impact of resharding by making the primary shard skip cloning documents and applying oplog entries if it is not a direct recipient for the collection being resharded. The design overlooked the fact that on a shard that is neither a donor nor a direct recipient, the transition to “strict-consistency” involves acquiring the critical section to prepare for collection renaming (SERVER-53653). So the performance optimization unexpectedly leads to early critical section on the primary shard, which can cause misrouted writes to get blocked long before the critical section is officially engaged by the coordinator when resharding is about to commit.

depends on

SERVER-114005 Resharding critical section timeout should cancel remaining steps on coordinator

Closed

SERVER-114004 Add command for resharding coordinator to notify recipients that critical section has started

Closed

is related to

SERVER-103554 Make ReshardingOplogFetcher fetch oplog entries from the primary during the critical section

Closed

SERVER-91109 Optimize resharding when primary shard owns zero chunks for resharded collection

Closed

SERVER-37501 Version multi-updates and multi-deletes in sharded transactions

Closed

SERVER-53653 [Resharding] Take the critical section when renaming on recipient shards

Closed

SERVER-106725 Enable featureFlagReshardingSkipCloningAndApplyingIfApplicable

Closed

related to

SERVER-109323 Disable featureFlagReshardingSkipCloningAndApplyingIfApplicable

Closed

SERVER-114854 Re-enable featureFlagReshardingSkipCloningAndApplyingIfApplicable

Open

SERVER-109962 Add liveness testing for CRUD operations during resharding

Backlog

SERVER-110368 Add server-side or test-side check that the lengths of resharding critical section on all donors and recipients are less than the length of the "blocking-writes" state on the coordinator

Backlog

SERVER-111929 Make resharding only skip cloning when it doesn't own any chunks

Closed

(2 is related to, 5 related to)

Assignee:: Cheahuychou Mao
Reporter:: Cheahuychou Mao
Participants:: Cheahuychou Mao, Githook User
Votes:: 0 Vote for this issue
Watchers:: 10 Start watching this issue

Created:: Aug 14 2025 10:02:29 PM UTC
Updated:: Dec 04 2025 05:03:18 PM UTC
Resolved:: Nov 26 2025 05:10:41 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates