Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.5.11
Affects Version/s: 3.4.5, 3.5.9
Component/s: Sharding
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v3.4
Sprint:
Sharding 2017-07-31
Linked BF Score:
0
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When multiple moveChunk commands "pile up" on a shard, only the first one actually runs, does the range deletion, and sets the last opTime on ReplClientInfo after performing the range deletion.

Therefore, when the other moveChunk commands go to wait for write concern before returning, they are not waiting on an opTime that includes the range deletion deletes.

So, it is possible (as occurred frequently in BF-5452) for a config stepdown to happen during a manual migration initiated through mongos; mongos to retry the manual migration; and the second manual migration to return before the range deletes have actually replicated.

If mongos then performs a secondary read including the donated range (which in v3.4 is unversioned, so will be sent to the donor shard) the read can return duplicate documents (because they have not yet been deleted). This is true even if the moveChunk request had waitForDelete: true and writeConcern: majority, and the read had readConcern: majority.

related to

SERVER-30183 a moveChunk that joins the active moveChunk on a shard may not respect its waitForDelete

Closed

Assignee:: Esha Maharishi (Inactive)
Reporter:: Esha Maharishi (Inactive)
Participants:: Esha Maharishi, Githook User
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: Jun 23 2017 06:07:30 PM UTC
Updated:: Oct 30 2023 11:15:44 PM UTC
Resolved:: Jul 18 2017 08:54:08 PM UTC
Confidence Status Last Update:: 17/Jul/17 6:11 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates