Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Critical - P2
Fix Version/s: 5.0.14, 6.0.2, 6.1.0-rc1, 6.2.0-rc0
Affects Version/s: 6.0.0, 5.0.10, 6.1.0-rc1
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v6.1
Steps To Reproduce:
Hide

Run this noPassthrough jstest.

/** * @tags: [ * requires_replication, * ] */ (function() { 'use strict'; const replTest = new ReplSetTest({nodes: 2}); replTest.startSet(); replTest.initiate(); const primary = replTest.getPrimary(); const db = primary.getDB('test'); assert.commandWorked(db['c'].insertOne({_id: 5, num: 0})); const s0 = db.getMongo().startSession(); s0.startTransaction(); assert.commandWorked(s0.getDatabase('test')['c'].deleteOne({_id: 5})); s0.commitTransaction(); const clusterTime = s0.getClusterTime().clusterTime; assert.commandWorked(db['c'].createIndex({'value': 1})); // Start a transaction whose snapshot predates the completion of the index build, and which reserves // an oplog entry (i.e. writes) after the index build commits. try { const s1 = db.getMongo().startSession(); s1.startTransaction({readConcern: {level: "snapshot", atClusterTime: clusterTime}}); s1.getDatabase('test').c.insertOne({_id: 5, num: 1}); // Transaction should have failed. assert(0); } catch (e) { assert(e.hasOwnProperty("errorLabels"), tojson(e)); assert.contains("TransientTransactionError", e.errorLabels, tojson(e)); assert.eq(e["code"], ErrorCodes.SnapshotUnavailable, tojson(e)); } replTest.stopSet(); })();
Show
Run this noPassthrough jstest. /** * @tags: [ * requires_replication, * ] */ (function() { 'use strict'; const replTest = new ReplSetTest({nodes: 2}); replTest.startSet(); replTest.initiate(); const primary = replTest.getPrimary(); const db = primary.getDB('test'); assert.commandWorked(db['c'].insertOne({_id: 5, num: 0})); const s0 = db.getMongo().startSession(); s0.startTransaction(); assert.commandWorked(s0.getDatabase('test')['c'].deleteOne({_id: 5})); s0.commitTransaction(); const clusterTime = s0.getClusterTime().clusterTime; assert.commandWorked(db['c'].createIndex({'value': 1})); // Start a transaction whose snapshot predates the completion of the index build, and which reserves // an oplog entry (i.e. writes) after the index build commits. try { const s1 = db.getMongo().startSession(); s1.startTransaction({readConcern: {level: "snapshot", atClusterTime: clusterTime}}); s1.getDatabase('test').c.insertOne({_id: 5, num: 1}); // Transaction should have failed. assert(0); } catch (e) { assert(e.hasOwnProperty("errorLabels"), tojson(e)); assert.contains("TransientTransactionError", e.errorLabels, tojson(e)); assert.eq(e["code"], ErrorCodes.SnapshotUnavailable, tojson(e)); } replTest.stopSet(); })();
Sprint:
Execution Team 2022-07-25, Execution Team 2022-08-08, Execution Team 2022-08-22, Execution Team 2022-09-05
Linked BF Score:
100
Confidence Status:
None
Work Order:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In the following scenario:

Multi-doc transaction starts, reading from a snapshot @ timestamp (9,1).
(10,1): index build on collection A completes.
(11,1): transaction writes to collection A, which involves updating the index that was built at (10, 1).

At step 3 the transaction doesn't fail with SnapshotUnavailable code name + TransientTransactionError label + "Unable to read from a snapshot due to pending collection catalog changes; please retry the operation" message.
This is a bug that opens the opportunity for a race condition to happen, which can result in data inconsistency as the index key won't get updated.

Adding _indexCatalogEntry->isReady(opCtx) in _indexKeysOrWriteToSideTable resolves the issue: step 3 fails with SnapshotUnavailable, instead of silently progressing with an invalid, stale snapshot.

is related to

SERVER-68573 Remove special handling in index code that handles index to be out-of-sync with snapshot

Closed

SERVER-47866 Secondary readers do not need to reacquire PBWM lock if there are catalog conflicts

Closed

SERVER-68455 Clean up and publish GDB helpers for dumping in-memory representation of WT tables

Closed

Assignee:: Josef Ahmad
Reporter:: Josef Ahmad
Participants:: Githook User, Josef Ahmad
Votes:: 0 Vote for this issue
Watchers:: 10 Start watching this issue

Created:: Jun 27 2022 07:48:19 AM UTC
Updated:: Oct 29 2023 09:36:25 PM UTC
Resolved:: Aug 31 2022 08:11:40 AM UTC
Confidence Status Last Update:: 13/Jul/22 9:57 AM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates