Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 3.7.3
Affects Version/s: None
Component/s: Replication
Labels:
- rollback-functional

Backwards Compatibility:
Fully Compatible
Sprint:
Repl 2018-02-26
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

KVStorageEngine implementations have their catalog persisted as "yet another" record store named the `_mdb_catalog`. For storage engines that support `recoverToStableTimestamp`, this table is not journaled, meaning it's only persisted when a stable checkpoint is taken, or from create collection oplog entries being replayed on replication recovery at startup.

Replication, naturally, does not replicate its internal collections which can lead to the following sequence:

Exit initial sync at time T. T is also the stable timestamp.
Node becomes a secondary.
Create the `oplogTruncateAfterPoint` collection.
Begin processing a patch, performing a write to the `oplogTruncateAfterPoint`.
The node crashes. The `oplogTruncateAfterPoint` document is required to correctly recover.
Node restarts.
MongoDB sees a storage engine table without a corresponding MongoDB collection, the table gets removed.
Replication recovery plays. Assumes there was no `oplogTruncateAfterPoint`, resulting in data corruption.

Explicitly creating `oplogTruncateAfterPoint` before coming out of initial sync is sufficient to guarantee that if a node starts up and decides it has completed initial sync, then the `oplogTruncateAfterPoint` collection will exist.

Assignee:: Daniel Gottlieb (Inactive)
Reporter:: Daniel Gottlieb (Inactive)
Participants:: Daniel Gottlieb, Githook User
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Feb 11 2018 02:44:36 AM UTC
Updated:: Oct 29 2023 10:34:51 PM UTC
Resolved:: Feb 16 2018 09:24:23 PM UTC

Details

Description

Attachments

Activity

People

Dates