Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Works as Designed
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Replication
Labels:
None

Assigned Teams:

Replication
Operating System:
ALL
Backport Requested:

v3.6
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The last phase of a secondary performing initial sync is to apply oplog operations up through some time `T` representing when the collection cloning phase completed. It's incorrect for a secondary to respond to majority read/at a timestamp queries before time T.

When a secondary comes out of initial sync, it will still have a notion of the replica sets majority commit time. Because the majority commit time is translated to a "read at a timestamp", the secondary will incorrectly respond to a query, but with a view of inconsistent data.

A couple starting points for solutions:

An API was introduced for recover to a stable timestamp known as the "initial data timestamp" that replication sets when initial sync completes. This represents the timestamp at which the data is in a consistent state. This could be used to reject/block incoming majority reads/read at a timestamp requests.
Alternatively, a secondary can refuse to come out of initial sync until the majority commit point passes `T`. Currently there is no mechanism to tell drivers which timestamps a node can service reads for. This solution would be a way to signal to drivers to not send majority reads the node cannot service, at the cost of not participating in reads `>= T`.

is depended on by

SERVER-30809 Investigating remaining writes to the [KV]Catalog that must be timestamped.

Closed

is related to

SERVER-32226 oldest_timestamp should track the last applied time, during initial sync

Closed

SERVER-30577 Clear list of stable timestamp candidates on Rollback and Initial Sync

Closed

related to

SERVER-32237 Nodes that cannot become primary must neither update progress nor vote "aye"

Closed

Assignee:: [DO NOT USE] Backlog - Replication Team
Reporter:: Daniel Gottlieb (Inactive)
Participants:: [DO NOT USE] Backlog - Replication Team, Daniel Gottlieb, Eric Milkie, Judah Schvimer
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Dec 06 2017 03:31:26 PM UTC
Updated:: Oct 27 2023 01:54:04 PM UTC
Resolved:: Dec 08 2017 09:51:51 PM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates