Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.2.6, 4.4.0-rc0, 4.7.0
Affects Version/s: 4.2.0
Component/s: Sharding
Labels:
- sharding-wfbf-day

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v4.4, v4.2, v4.0
Sprint:
Sharding 2020-03-09
Linked BF Score:
7
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Scenario:

shardKey: x: 1
chunks: [MinKey, 0) @ shard1, [0, MaxKey) @ shard0

1. Txn sets read concern timestamp to t5.
2. Migration move document x: 1, y: 1, from shard0 (last chunk) to shard1 at t10.
3. Mongos refreshes to latest chunk metadata.
4. Txn targets update/delete with predicate y: 1. This will generate an index bound of [MinKey, MaxKey).
5. ChunkManager::getShardIdsForRange will go through every chunk that overlaps with [MinKey, MaxKey) and get the shardId at t5.
6. However, the loop has an optimization to early exit if the number of shards that should be targeted is equal to the shard version map. This will cause the loop to exit early and cause the write to target only shard1.

The issue here is that the shard version map only include shards with chunks and represents the mapping at t10 and not t5. In the case above, there were 2 shards that had chunks at t5, but only 1 shard that had chunks at t10. Even though the document is currently in shard1, the update/remove will not see it because it is running under the snapshot with ts = t5.

Assignee:: Randolph Tan
Reporter:: Randolph Tan
Participants:: Githook User, Jack Mulrow, Randolph Tan
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Oct 04 2019 08:08:13 PM UTC
Updated:: Oct 29 2023 10:16:24 PM UTC
Resolved:: Mar 04 2020 05:56:40 PM UTC

Details

Description

Attachments

Forms

Activity

People

Dates