Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.1.5
Affects Version/s: None
Component/s: None
Labels:
- prepare_errors

Backwards Compatibility:
Fully Compatible
Sprint:
Repl 2018-08-27, Repl 2018-09-10, Repl 2018-09-24, Repl 2018-10-08, Repl 2018-10-22, Repl 2018-11-05, Repl 2018-11-19
Linked BF Score:
56
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In order to complete this ticket, we will first need to build on the work done in ~~SERVER-35873~~. First, we should preserve OpTimes instead of Timestamps since we calculate the stable timestamp using OpTimes. Next, we want to keep a separate data structure that tracks all "oldest oplog entry OpTimes" for each transaction whose corresponding abort/commit oplog entry has not been majority committed. To do this, when we remove an "oldest active OpTime" from oldestActiveOplogEntryOpTimes, we will also add it to oldestActiveUncommittedOpTimes along with its corresponding "finishOpTime" (which is the OpTime of the commit/abort oplog entry).

oldestActiveUncommittedOpTimes will be a set of OpTime pairs -> (oldest active OpTime per transaction, corresponding commit/abort oplog entry OpTime).

This should be enough information to set the stable timestamp back to the "oldest active OpTime of transactions whose corresponding commit/abort oplog entries have not been majority committed". Let's refer to this as the "oldest active uncommitted txn OpTime" for now. In ReplicationCoordinatorImpl, we have a function that calculates the stable OpTime. Instead of just calculating the min of the "all committed timestamp" and the current "commit point," we will also add the "oldest active uncommitted txn Timestamp" to this comparison and take the min of all three.

The last thing we need to do is properly remove an OpTime pair once the commit point advances past an entry's commit/abort oplog entry. This would mean that the entry is majority committed and we are able to move the stable timestamp.

Since this code will be separate from the code maintaining the oldestActiveOpTime, it should be simpler to remove once we no longer need to hold the stable timestamp back.

depends on

SERVER-35873 Maintain the oldest oplog entry timestamp of any active transaction

Closed

is depended on by

SERVER-36782 WT erring because commit timestamp is older than stable timestamp

Closed

SERVER-35877 Secondaries commit transactions when applying commitTransaction oplog entries in their own batch

Closed

SERVER-36023 Try to enable replica set transaction testing with the inMemory WT storage engine

Closed

is related to

SERVER-38302 Committing or aborting prepared transactions may fail to un-pin stable timestamp

Closed

Assignee:: Pavithra Vetriselvan
Reporter:: Gregory McKeon (Inactive)
Participants:: Githook User, Gregory McKeon, Pavithra Vetriselvan
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Jun 26 2018 03:56:29 PM UTC
Updated:: Oct 29 2023 10:30:23 PM UTC
Resolved:: Nov 06 2018 06:58:11 PM UTC
Confidence Status Last Update:: 30/Aug/18 5:50 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates