Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Done
Priority: Major - P3
Fix Version/s: 3.4.4, 3.5.6
Affects Version/s: None
Component/s: Replication
Labels:
- bkp

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v3.4
Sprint:
Repl 2017-04-17
Linked BF Score:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

There is a race where the sync source's most recent oplog entries could become visible and be read by downstream nodes before the sync source updates its heartbeat map with its new last applied OpTime. This can cause downstream nodes to get stale lastAppliedOpTimes in metadata which can be a problem for OplogFetcher::checkRemoteOplogStart.

This will only cause the OplogFetcher to return early and choose a new sync source, so it should not cause harm beyond unnecessary sync source changes and some very quick initial sync restarts.

We should remove the check that the remote last applied OpTime is greater than or equal to the local last fetched OpTime in OplogFetcher::checkRemoteOplogStart when "requireFresherSyncSource" is false. This will also require changing the comments that explain the boolean's meaning.

An alternative is to use the max of the metadata lastOpApplied and the last OpTime in the batch as the remote last applied OpTime in OplogFetcher::checkRemoteOplogStart.

is related to

SERVER-27403 Consider term and rbid when validating the proposed sync source

Closed

Assignee:: Matthew Russotto (Inactive)
Reporter:: Judah Schvimer
Participants:: Githook User, Judah Schvimer, Matthew Russotto
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Mar 17 2017 03:51:33 PM UTC
Updated:: Sep 07 2017 05:07:21 AM UTC
Resolved:: Apr 03 2017 11:57:19 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates