Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.6.0-rc5
Affects Version/s: None
Component/s: Replication
Labels:
- bkp

Backwards Compatibility:
Fully Compatible
Backport Requested:

v3.6
Sprint:
Repl 2017-11-13, Repl 2017-12-04
Linked BF Score:
0
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The list of potential stable timestamp candidates is maintained in memory, in
ReplicationCoordinatorImpl::_stableTimestampCandidates. It will need to be cleared after recovering to a timestamp or when starting a new initial sync. During rollback, after recovering to the stable timestamp, the replication system will apply oplog entries between the stable timestamp and the common point. Before this oplog application process, we should clear the list of all timestamps except the stable timestamp. The oplog entries between the stable timestamp and the common point may get applied differently than they were originally, so we need to clear the timestamp list before doing this to avoid leaving timestamp candidates in the list that might no longer fall at a consistent point.

Note on the 'rollbackViaRefetch' algorithm in 3.6:

For 3.6, where we still use the "rollbackBackViaRefetch" algorithm, we will need to do the following to make sure we are never setting the stable timestamp to a timestamp at an inconsistent state:

Upon entering ROLLBACK, set a flag, dataConsistent=false
Upon reaching minValid, in tryToGoLiveAsASecondary in sync_tail.cpp, set dataConsistent=true, since reaching the minValid optime implies that the database state is now consistent.
We should never add an optime to the set of stable optime candidates in ReplicationCoordinator if dataConsistent=false
Upon leaving ROLLBACK, clear the list of stable optime candidates that are past the current stable optime.

is related to

SERVER-32041 amend stableCandidates logic from replcoord

Closed

related to

SERVER-32185 Freshly synced secondaries respond to queries before their "sync time"

Closed

SERVER-29891 Roll Back to Checkpoint: Call setStableTimestamp() when commit point or last applied changes

Closed

SERVER-47844 Update _setStableTimestampForStorage to set the stable timestamp without using the stable optime candidates set when EMRC=true

Closed

Assignee:: Will Schultz
Reporter:: Will Schultz
Participants:: Eric Milkie, Githook User, Spencer Brody, Will Schultz
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Aug 09 2017 06:44:44 PM UTC
Updated:: Oct 30 2023 11:14:29 PM UTC
Resolved:: Nov 21 2017 09:05:44 PM UTC
Confidence Status Last Update:: 09/Nov/17 7:41 PM

Details

Description

Note on the 'rollbackViaRefetch' algorithm in 3.6:

Attachments

Issue Links

Forms

Activity

People

Dates