Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- resharding-improvements

Assigned Teams:

Cluster Scalability
Operating System:
ALL
Sprint:
Cluster Scalability Priorities
Confidence Status:
None
Work Order:
3
Size Category:
TBD
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

During a reshardCollection operation, if a recipient shard steps down while processing the ShardsvrReshardRecipientCloneCommand before persisting recipient state document and awaiting the completion of majority write, the command can hang indefinitely.

This hang occurs because the command waits on a future _transitionedToCreateCollection that is not fulfilled with an error during the recipient service's mandatory cleanup on stepdown. The command's operation context provides a cancellation token, which is intended to allow interruption during shard stepDown. However, the method used to hook this cancellation – setAlwaysInterruptAtStepDownOrUp_UNSAFE is unsafe because it does not properly synchronize with RSTL, potentially preventing the command from being interrupted as expected during stepDown if it misses the state transition. As a result the reshard operation doesn't make forward progress.

I think one way to resume the progress would be to failover the config server primary what would make it retry the command.

is related to

SERVER-104258 Resharding Can Hang If Recipient Fails During ShardsvrReshardRecipientClone

Backlog

related to

SERVER-105214 Audit calls to "opCtx->setAlwaysInterruptAtStepDownOrUp_UNSAFE();"

Backlog

Assignee:: Unassigned
Reporter:: Abdul Qadeer
Participants:: Abdul Qadeer, TPM Jira Automations Bot
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Due:: 31/Jul/25
Created:: Apr 30 2025 03:45:48 AM UTC
Updated:: Jun 16 2025 09:35:28 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates