Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 6.0.1, 6.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v6.0
Sprint:
Execution Team 2022-06-27
Linked BF Score:
0
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

TL;DR the periodic thread updating oplogReadTimestamp doesn't have sufficient mutex coverage to avoid immediately setting the oplogReadTimestamp forward again after cappedTruncateAfter does a direct oplogReadTimestamp update backwards. If the timing is just right.

-----------------------------------------------------------------
This is my explanation from the test failure ticket:

RecordStore::cappedTruncateAfter has special logic to update the oplogReadTimestamp if it's the record store for the oplog collection. Meanwhile, there's a thread that periodically updates the oplogReadTimestamp. Of note in the thread's logic, it releases the mutex protecting oplogReadTimestamp writes/reads while fetching the WT all_durable timestamp. So here's what I propose happened:

1. The oplogReadTimestamp is T(5,30)
2. PeriodicThread fetches the all_durable timestamp T(5,30)
3. Op1 truncates the oplog back to T(5,1), deleting T(5,20) & T(5,30)
4. Op1 then sets the oplogReadTimestamp to T(5,3)
5. PeriodicThread then moves the oplorReadTimestamp forward to T(5,30)

So in theory, any internal operation truncating the oplog while the server is up and running (not startup or rollback) could cause this race. If such code exists anywhere. Startup and rollback both restart the storage engine, reseting the all_durable timestamp, and do not have this issue with oplog truncation.

Assignee:: Dianna Hohensee (Inactive)
Reporter:: Dianna Hohensee (Inactive)
Participants:: Dianna Hohensee, Githook User
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: May 17 2022 06:58:29 PM UTC
Updated:: Oct 29 2023 09:38:09 PM UTC
Resolved:: Jun 14 2022 04:56:47 PM UTC
Confidence Status Last Update:: 10/Jun/22 5:01 PM

Details

Description

Attachments

Forms

Activity

People

Dates