Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Replication
Case:
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

There's an interleaving between an election and oplog fetching that makes it possible for a majority of nodes in a replica set to go into rollback. Here's an example:

Node 0 is primary and it is at (timestamp: 10, term: 1)
Node 1 decides to run for an election and its last applied is (timestamp: 10, term: 1)
Node 2 receives a vote request from node 1 and says yes because its last applied is also (timestamp: 10, term: 1)
Node 2 wins the election and starts primary catchup and based on heartbeats its target optime is (timestamp: 10, term: 1). It writes a new term oplog entry at (timestamp 11, term:2)
Node 0 accepts a write at (timestamp 12, term:1)
Node 1 replicates the write at (timestamp 12, term:1) because it hasn't changed its sync source yet
Before Node 0 hears back from Node 1, Node 0 steps down and tries to sync from node 0, but realizes it needs rollback (timestamp 12, term:1)
Node 1 syncs from node 0 and realizes it needs to rollback (timestamp 12, term:1)

Rollback can be a very slow operation that can takes tens of minutes. In this situation, the multiple rollbacks cause write unavailability until at least one of the nodes can return to the secondary state.

We should be careful to make sure that any solutions in this space do not make other, more common, operations much worse.

Assignee:: Unassigned
Reporter:: Samyukta Lanka
Participants:: Samyukta Lanka
Votes:: 1 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Feb 04 2026 03:26:50 PM UTC
Updated:: Feb 09 2026 06:51:12 PM UTC

Details

Description

Attachments

Activity

People

Dates