Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.0.19, 4.2.7, 3.6.19, 4.4.0-rc4, 4.7.0
Affects Version/s: None
Component/s: Replication
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v4.4, v4.2, v4.0, v3.6
Sprint:
Repl 2020-05-18
Linked BF Score:
8
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

If a thread does a write that gets rolled back during stepdown, its client can have _lastOp with timestamp higher than the timestamp of system last opTime (if the wallclock on primary is behind the wallclock on the node the threads runs on). So if after stepdown the thread sends a write command to itself, the command will fail the ReplicationCoordinator check when trying to write an oplog entry but the NotMaster error will get caught in this block in the ServiceEntryPoint::runCommandImpl. Since the command is a noop, the client's lastOp will be set to last system opTime. So after the wait for writeConcern fails, the NotMaster error will get propagated up, and the operation will hit the invariant operationTime >= startOperations when trying append operationTime to the response.

related to

SERVER-30842 Don't try to set last optime for client backwards after rollback

Closed

Assignee:: Lingzhi Deng
Reporter:: Cheahuychou Mao
Participants:: Cheahuychou Mao, Githook User, Lingzhi Deng
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Apr 22 2020 04:35:30 AM UTC
Updated:: Oct 29 2023 10:09:12 PM UTC
Resolved:: May 04 2020 04:43:59 PM UTC