Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.0.7, 4.1.8
Affects Version/s: 4.0.0-rc5, 4.1.1
Component/s: Replication, Sharding
Labels:
- sharding-wfbf-day

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v4.0
Sprint:
Sharding 2018-08-13, Sharding 2018-09-10, Sharding 2018-09-24, Sharding 2018-10-08, Sharding 2018-10-22, Sharding 2018-11-05, Sharding 2018-12-17, Sharding 2018-12-31, Sharding 2019-01-14, Sharding 2019-01-28
Linked BF Score:
64
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

If a node crashes with unapplied oplog entries, when it starts back up it will apply to the end of its oplog through ReplicationRecoveryImpl::recoverFromOplog. This applies the entries by directly calling SyncTail::multiApply (through an OplogApplier), which does not update the logical clock, unlike normal secondary application. Then when starting up its replication coordinator, the node will asynchronously schedule ReplicationCoordinatorImpl::_finishLoadLocalConfig which updates the logical clock after it updates the replication coordinator's lastAppliedOpTime to the opTime of the latest oplog entry.

If a request is processed during this window in _finishLoadLocalConfig, when the node goes to compute logical time metadata for the response, it can hit this invariant because the operationTime, which is typically the lastAppliedOpTime, will be greater than the latest time in the logical clock.

Two ways to fix this would be to have replication recovery update the logical clock when applying the unapplied oplog entries or to update the global timestamp before updating lastAppliedOpTime when finishing loading the local replica set config.

is related to

SERVER-46257 OplogFetcher should run LogicalTimeMetadataHook on reply metadata

Closed

SERVER-46308 Investigate dependency between commit point (lastCommitted) and cluster time

Closed

Assignee:: Jack Mulrow
Reporter:: Jack Mulrow
Participants:: Githook User, Jack Mulrow, Judah Schvimer
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Jun 18 2018 08:54:17 PM UTC
Updated:: Oct 29 2023 10:30:39 PM UTC
Resolved:: Jan 17 2019 11:14:43 PM UTC
Confidence Status Last Update:: 03/Dec/18 6:25 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates