-
Type: Bug
-
Resolution: Fixed
-
Priority: Major - P3
-
Affects Version/s: None
-
Component/s: Replication
-
None
-
Fully Compatible
-
ALL
-
v4.2, v4.0
-
Repl 2019-08-26
SERVER-33812 attach afterClusterTime to all oplog queries. A node with higher timestamp but lower term than the sync source should roll back due to an empty batch, e.g. the old primary has (ts: 9, term: 1), while the new primary has (ts: 8, term: 2). However, the oplog query failed with MaxTimeMSExpired added in SERVER-35200. I believe the query times out while waiting for afterClusterTime. In production, it's very likely the old primary will roll back when new writes arrive with even higher timestamp, maybe by the periodic no-op writer. However, it is still a liveness issue.
- is related to
-
SERVER-42219 Oplog buffer not always empty when primary exits drain mode
- Closed
- related to
-
SERVER-33812 First initial sync oplog read batch fetched may be empty; do not treat as an error.
- Closed
-
SERVER-35200 Speed up failure detection in the OplogFetcher during steady state replication
- Closed