Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 7.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Storage Execution
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
Execution Team 2023-03-06, Execution Team 2023-03-20, Execution Team 2023-04-17
Linked BF Score:
4

It looks like a time-series insert can return a plain 'Interrupted' error code after a stepdown (at least in certain circumstances) rather than the expected 'InterruptedDueToReplStateChange', resulting in the operation not being retried.

This resulted in BF-25238. We have a band-aid fix (~~SERVER-67533~~) to exclude the test from the relevant concurrency passthroughs for the time being, but we would like to get to the root cause. It only seems to affect time-series inserts, not regular collections, and it only seems to happen in the sharded concurrency passthroughs with stepdowns involved.

In the continuous stepdown suites, you get the 'Interrupted' error code with a default error message. In the kill_primary suites, you get the 'Interrupted' error code with a more descriptive error message about read preference, like "Write results unavailable from failing to target a host in the shard shard-rs0 :: caused by :: Could not find host matching read preference { mode: \"primary\" } for set shard-rs0".

Assignee:: Dianna Hohensee (Inactive)

Reporter:: Dan Larkin-York

Participants:: Dan Larkin-York, Dianna Hohensee, Githook User, Sergi Mateo Bellido

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: Jun 28 2022 04:47:00 PM UTC

Updated:: Oct 29 2023 09:36:18 PM UTC

Resolved:: Apr 12 2023 12:57:58 PM UTC

Details

Description

Attachments

Activity

People

Dates