Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.2.0-rc0
Affects Version/s: None
Component/s: None
Labels:
- resharding-success-rate-improvements

Assigned Teams:

Cluster Scalability
Backwards Compatibility:
Fully Compatible
Sprint:
ClusterScalability Apr28-May09, ClusterScalability May12-May25
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Currently, to enter the critical section, a donor needs to do a shard version refresh and process the resharding fields. The former involves doing a noop write with writeConcern "majority" with a timeout of 60 seconds. The latter involves persisting the configTime (most recent majority timestamp on the CSRS) to the config.vectorClock collection with writeConcern "majority" with a timeout of 60 seconds.

For this reason, majority replication lag on a donor can make it to fail to transition to the critical section within the critical section timeout or soon enough for the recipients to finish fetching and applying oplog entries within the critical section timeout.

Please note that the state transition writes on a donor don't involve waiting for writeConcern "majority".

depends on

SERVER-104531 Support shardsvrReshardingOperationTime on donor shards

Closed

SERVER-105207 Refactor helpers for mocking ShardsvrReshardingOperationTime responses in CoordinatorCommitMonitorTest

Closed

is related to

SERVER-103932 Make ReshardingCoordinatorCommitMonitor account for replication lag on recipients

Closed

related to

SERVER-105842 Make ReshardingOplogFetcher fetch oplog entries from the primary when the recipient is approaching strict consistency to prepare for the critical section

Closed

Assignee:: Cheahuychou Mao
Reporter:: Cheahuychou Mao
Participants:: Cheahuychou Mao, Githook User
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Apr 24 2025 05:29:27 PM UTC
Updated:: Jun 03 2025 03:19:40 PM UTC
Resolved:: May 22 2025 03:47:10 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates