Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.1.9
Affects Version/s: None
Component/s: Replication
Labels:
- open_todo_in_code
- prepare_durability

Backwards Compatibility:
Fully Compatible
Sprint:
Repl 2018-12-17, Repl 2019-01-14, Repl 2019-01-28, Repl 2019-02-11, Repl 2019-02-25, Repl 2019-03-11
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After the work for ~~SERVER-35879~~ goes in, we will already have a way to apply prepare oplog entries during replication recovery. This work includes iterating over the transactions table, finding which sessions had a prepared transaction on them, and applying the prepare oplog entry.

Before we apply these oplog entries, however, we will need to correctly refresh the session state as well as the state of the transaction participant because these will have both been invalidated at the beginning of replication rollback. This must happen before we try to modify the transaction participant (i.e. call unstashTransactionResources or prepareTransaction). There are a couple ways that we can approach this.

First, we could thread a boolean through _recoverFromOplog, _reconstructPreparedTransactions, and applyRecoveredPrepareTransaction. Once we get to applyRecoveredPrepareTransaction, we can check to see if we are recovering from a rollback and refresh the session and transaction participant states.

The second option is to check the OplogApplication mode and if its in OplogApplication::Mode::kRecovering, then refresh the session and transaction participant. Since kRecovering applies to startup recovery AND replication recovery, this would only work if it's safe to do this during startup recovery. During replication recovery, we would not be making any writes to the transactions table, so refreshing the state from disk would not cause us to read those writes and start a new transaction. If the same thing applies to startup recovery, this could be a more elegant solution than the first.

Finally, in both solutions, we would need to introduce a new helper (something like refreshTxnParticipantFromTable) that reconstructs the state of the transaction participant before we cleared it for rollback. This information should be available from the prepare oplog entry.

We would test this via jstests since we would need to induce a rollback and ensure that we have not lost any prepared transactions by the end of the recovery process.

depends on

SERVER-35879 Add support for reconstituting transactions in their correct state from the transaction table during startup recovery

Closed

SERVER-38865 Create rollback test fixture that is compatible with prepared transactions

Closed

is depended on by

SERVER-39762 Fix fastcount after rollback recovery of prepared transactions.

Closed

SERVER-37886 Remove config server as coordinator crutch from coordinator stepdown targeted tests

Closed

Assignee:: Pavithra Vetriselvan
Reporter:: Gregory McKeon (Inactive)
Participants:: Githook User, Gregory McKeon, Jack Mulrow, Judah Schvimer, Pavithra Vetriselvan, Samyukta Lanka
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Jun 28 2018 03:53:35 PM UTC
Updated:: Oct 29 2023 10:30:17 PM UTC
Resolved:: Feb 28 2019 03:40:17 PM UTC
Confidence Status Last Update:: 10/Dec/18 4:52 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates