Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.3.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Cluster Scalability
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Steps To Reproduce:

Hide

See the changes here in the flush_resharding_state_change_command.cpp and the concurrency_sharded_replication_with_balancer_and_config_transitions.yml files.

Show
See the changes here in the flush_resharding_state_change_command.cpp and the concurrency_sharded_replication_with_balancer_and_config_transitions.yml files.
Sprint:
ClusterScalability Jul21-Aug3
Linked BF Score:
200
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

If FlushReshardingStateChangeCmd fails due to a write concern timeout and never refreshes the resharding state it will log that it failed, but not return a status to indicate it failed.

The resharding coordinator will then not be able to retry and resharding will hang because the resharding participants will not be able to make progress if this command is called during cloning / recipients will not be established.

See this patch build for a reproducer and logs that show this failure mode.

The easiest fix is to have this command return a status instead of being void.

is related to

SERVER-58081 _flushReshardingStateChange from coordinator races with donor shard acquiring critical section, stalling the resharding operation

Closed

related to

SERVER-111039 Investigate transient errors that can be thrown by the catalog cache

Backlog

SERVER-104317 Update WithAutomaticRetry to retry on WCEs

Closed

Assignee:: Cheahuychou Mao
Reporter:: Ben Gawel (Inactive)
Participants:: Ben Gawel, Cheahuychou Mao, Githook User
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Jul 22 2025 07:55:24 PM UTC
Updated:: Sep 17 2025 10:04:52 AM UTC
Resolved:: Jul 28 2025 08:04:09 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates