Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Catalog and Routing
Operating System:
ALL
CAR Domain/s:

🟩 Routing and Topology

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Similar to Similar to SERVER-117612 (ConvertToCappedCoordinator), ~~SERVER-117613~~ (CreateCollectionCoordinator), and SERVER-117615 (RefineCollectionShardKeyCoordinator), the AddShardCoordinator has a gap in its mustAlwaysMakeProgress logic which can allow the coordinator to exit without cleaning up.

The current implementation uses mustAlwaysMakeProgress from the kPrepareNewShard phase and relies on triggerCleanup in the onError to force the repeat of the cleanup in the kCheckShardPreconditions phase. If an error occurs after blocking setFCV and user writes AND cleanup fails before persisting the abort reason, the coordinator gives up without cleaning up.

is related to

SERVER-117612 ConvertToCappedCoordinator may exit without proper cleanup leaving the critical section held

Backlog

SERVER-117615 RefineCollectionShardKeyCoordinator may exit without proper cleanup leaving migrations frozen

In Code Review

SERVER-117613 CreateCollectionCoordinator may exit without proper cleanup leaving the critical section held

Closed

Assignee:: Unassigned
Reporter:: Allison Easton
Participants:: Allison Easton
Votes:: 0 Vote for this issue
Watchers:: 1 Start watching this issue

Created:: Mar 11 2026 11:11:40 AM UTC
Updated:: Mar 11 2026 11:11:48 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates