Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Operating System:
ALL
Backport Requested:

v5.0, v4.4, v4.2
Sprint:
Repl 2021-06-28, Repl 2021-07-12, Repl 2021-07-26
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

It is possible for a stepdown to start due to some other primary stepping up while we are still holding the RSTL from a step-up attempt. If we do this while we have a transaction prepared, we will uassert when trying to check out a session to restore the prepared transactions locks.

https://github.com/mongodb/mongo/blob/b9c4dc61d38edd4ae1c4953dbc646fac633d78d0/src/mongo/db/session_catalog_mongod.cpp#L271

The uassert will cause use to exit signalDrainComplete() without actually signalling that the drain is complete. At that point the oplog applier (and thus replication) will be stuck.

In addition to fixing this, we should probably mark signalDrainComplete() as "noexcept" so we crash instead of hanging if anything similar happens.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

repro.SERVER-54545
Jun 17 2021 05:57:45 PM UTC
10 kB
Matthew Russotto

duplicates

SERVER-57756 Race between concurrent stepdowns and applying transaction oplog entry

Closed

is related to

SERVER-58440 Mark signalDrainComplete as noexcept

Closed

Assignee:: Vesselina Ratcheva (Inactive)
Reporter:: Matthew Russotto
Participants:: Matthew Russotto, Vesselina Ratcheva
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Jun 08 2021 10:31:53 PM UTC
Updated:: Jul 13 2021 05:57:19 PM UTC
Resolved:: Jul 13 2021 05:57:07 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates