Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.4.14
Affects Version/s: 3.2.14, 3.4.4
Component/s: Sharding
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
Sharding 2018-02-26, Sharding 2018-03-12
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The range deleter waits for replication on two occasions:

First using the moveChunk operation's write concern in Helpers::removeRange which does log the time spent for replication.
Second time using a 'majority' write concern, which does not log at all.

This second majority wait is completely unnecessary. The migration recipient side can keep going without attempting a majority write until the very end, after all documents have been transferred.

As part of fixing this bug, we should consider the following:

Before even accepting a migration request, the recipient shard should do a best-effort attempt to check how behind it is from the rest of the replica set (perhaps by doing a majority write with some timeout then) and if that fails, don't even attempt a migration. This is the counterpart of ~~SERVER-22876~~.
If the migration was for an empty chunk and we didn't patch up any indexes, do not do any replication waits at all and enter the READY state immediately.

related to

SERVER-29807 RangeDeleter should log when its about to wait for majority replication

Closed

Assignee:: Kevin Pulo
Reporter:: Kaloian Manassiev
Participants:: Dianna Hohensee, Githook User, Kaloian Manassiev, Kevin Pulo
Votes:: 0 Vote for this issue
Watchers:: 13 Start watching this issue

Created:: Jun 23 2017 12:05:24 PM UTC
Updated:: Oct 30 2023 11:15:46 PM UTC
Resolved:: Feb 28 2018 11:57:21 AM UTC
Confidence Status Last Update:: 21/Feb/18 12:27 AM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates