Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Storage Execution
Operating System:
ALL
Steps To Reproduce:

Hide

Apply attached create_index_operation_time.diff and run test.

Show
Apply attached create_index_operation_time.diff and run test.
Sprint:
Storage Execution 2025-08-04, Storage Execution 2025-08-18, Storage Execution 2025-09-01
Linked BF Score:
200
Confidence Status:
None
Work Order:
3
Size Category:
TBD
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When a createIndex is a no-op, and this has been detected while attempting to start the index build, as opposed to the pre-start checks, the createIndex command returns an operationTime that is earlier than the commitIndexBuild timestamp (even though it correctly waits for the index build thread to finish).

In general, we have in place a mechanism in the SEP to bump the current operation's opTime to the system's last opTime, when we detect the operation is a write but resulted in a no-op.

This mechanism relies on the current operationTime being different than the opTime before starting, but createIndexes bumps the current operation's opTime in case of failure while running IndexBuildsCoordinator::startIndexBuild, and does so before waiting for the index build thread to finish.

After waiting for the build to finish, the code is structured in such a way that we execute the same function to generate the reply. But the second time we won't execute the code path that bumps the operationTime. Afterwards, when the createIndexes command goes through the SEP code which usually bumps the operationTime in case of no-op, given that the lastOpAfterRun is already different than lastOpBeforeRun, nothing is done. Thus returning an operationTime which predates the commit timestamp of the index.

The above means that waiting for write concern may be done with an incorrect timestamp, and that causally consistent sessions which rely on the operationTime might not work as expected.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

create_index_operation_time.diff
3 kB
Jul 14 2025 01:24:47 PM UTC

Assignee:: Thomas Goyne
Reporter:: Yujin Kang Park
Participants:: Thomas Goyne, Yujin Kang Park
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Jul 14 2025 01:58:21 PM UTC
Updated:: Aug 29 2025 09:52:40 PM UTC

Details

Description

Attachments

Attachments

Forms

Activity

People

Dates