Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Won't Do
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Layered Tables
Security Level: Public (Available to anyone on the web)
Labels:
- lc_bulk_04_29_26

Assigned Teams:

Storage Engines - Foundations
Total Hours with Assigned Team:
1,101.812
Sprint:
None
Story Points:
None

When we open a layered cursor on a follower we check for the last checkpoint and if there hasn't been any checkpoint yet we open a live btree for an empty stable table which is fine. The logic looks like:

WT_ERR_NOTFOUND_OK(
__wt_meta_checkpoint_last_name(session, stable_uri, &checkpoint_name, NULL, NULL), true);


if (ret == WT_NOTFOUND) {
    // open live btree
}

However there might be a race when checkpoint arrives right after __wt_meta_checkpoint_last_name so we end up opening a shared live btree for a checkpoint on a follower which could lead to a data corruption, if we have a step up event right after we do this so we then write this old checkpoint to the disk.

This ticket scope is to introduce a production assertion to double check that checkpoint hasn't arrived after we open a live btree. This assertion will turn this problem from being a data corruption to a an availability problem.

is related to

WT-16476 Never open a live btree handle on the stable table on standby even if there is no checkpoint

Closed

related to

WT-17244 Assertion failure in ASAN variant when follower reads from stable cursor

Closed

Assignee:: Ivan Kochin
Reporter:: Ivan Kochin
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Apr 22 2026 01:11:51 AM UTC
Updated:: May 04 2026 11:35:31 PM UTC
Resolved:: May 01 2026 03:20:51 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates