Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.3.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Catalog and Routing
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
CAR Team 2025-11-10
CAR Domain/s:

🟩 Routing and Topology

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

During addShard, we propagate the setUserWriteBlock state from the cluster to the new shard. We also block new setUserWriteBlock coordinators from running, however we do not drain the ongoing ones. This can lead to the following scenario which leads to the new shard having incorrect userWriteBlock state:

start setUserWriteBlock coordinator
start add shard
commit add shard
setUserWriteBlock takes the stable topology region and sends the state to all shards but does not yet write it on the configsvr
add shard propagates the old setuserwriteblock state to the new shard
setUserWriteBlock writes the new value on the config server

Rather than relying on the stable topology region in the setUserWriteBlock coordinator, we should drain the ongoing coordinators in the addShard command ensuring that the state cannot change while a shard is being added.

Assignee:: Wolfee Farkas
Reporter:: Allison Easton
Participants:: Allison Easton, Githook User, Wolfee Farkas
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Oct 30 2025 08:42:51 AM UTC
Updated:: Oct 30 2025 07:04:54 PM UTC
Resolved:: Oct 30 2025 07:04:54 PM UTC

Details

Description

Attachments

Activity

People

Dates