Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Gone away
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Sharding
Labels:
None

Operating System:
ALL
Sprint:
Sharding 2020-09-21, Sharding 2020-10-05, Sharding 2020-10-19
Linked BF Score:
13
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After addShard, we call ShardRegistry::reload() on the router so that the following request on the same client will be able to target that shard without receiving a ShardNotFound error. However, with the streamable replica set monitor, calls to onPossibleSet can then overwrite the host data on the ShardRegistry concurrently, leading to a ShardNotFound error on a subsequent request. It doesn't seem like there was ever a guarantee of shard add/remove operations being causally consistent with CRUD ops in any meaningful way, but this breaks tests that used to rely on the shard being available after addShard.

is related to

SERVER-35252 All config server metadata commands that read from ShardRegistry might read stale data

Backlog

SERVER-46202 Implement ShardRegistry on top of ReadThroughCache to make it causally consistent

Closed

related to

SERVER-48996 Race between isMaster response connection hook and RSM topology change triggers ShardNotFound

Closed

Assignee:: Lamont Nelson
Reporter:: Matthew Saltz (Inactive)
Participants:: Kaloian Manassiev, Lamont Nelson, Matthew Saltz
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: May 13 2020 09:26:47 PM UTC
Updated:: Oct 27 2023 08:42:14 PM UTC
Resolved:: Oct 09 2020 05:26:47 AM UTC
Confidence Status Last Update:: 08/Jun/20 3:22 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates