Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.7.2
Affects Version/s: None
Component/s: Replication, Storage
Labels:
- rollback-functional

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
Repl 2018-02-26
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When replication runs in a mode containing only one voting member, its commit point, more-or-less, tracks the primary's last applied optime.

However, this algorithm does not consider in-flight operations, which may have already been assigned an earlier optime. In these cases, it's possible for replication to advance the local "last applied" optime to T, and then learn of an operation that completes at T-1. In the constrained scenario of one voting node, this means replication can advance the commit point to T before T-1 commits in the storage layer. The problem can be restated as, replication's commit point does not respect oplog visibility.

When there are multiple voting nodes, those replicating nodes only learn of operation T after T-1 has become visible, thus preventing the commit point from advancing in the face of concurrent operations.

It's unclear if this premature setting of the commit timestamp breaks any assumptions within replication. At a low level, earlier commit timestamps is documented as a result of heartbeats coming in out of order. However, storage is sensitive to storage transactions committing at a time before the oldest_timestamp (the replica set commit point), and in turn intentionally lag the oldest_timestamp.

If replications feels this is correct behavior, this ticket would need to turn into a storage ticket such that consumers of setStableTimestamp process the input against the oplog read timestamp before propagating the [stable timestamp/oldest timestamp] value to WiredTiger.

is depended on by

SERVER-29213 Have KVWiredTigerEngine implement StorageEngine::recoverToStableTimestamp

Closed

Assignee:: Daniel Gottlieb (Inactive)
Reporter:: Daniel Gottlieb (Inactive)
Participants:: Daniel Gottlieb, Githook User
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Feb 05 2018 05:29:54 PM UTC
Updated:: Oct 29 2023 10:35:05 PM UTC
Resolved:: Feb 12 2018 05:33:55 PM UTC
Confidence Status Last Update:: 12/Feb/18 3:04 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates