Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.1.0-rc0, 8.0.0-rc5
Affects Version/s: 5.0.0, 6.0.0, 7.0.0, 8.0.0-rc0, 7.3.0
Component/s: None
Labels:
None

Assigned Teams:

Catalog and Routing
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v8.0, v7.0, v6.0, v5.0
Sprint:
CAR Team 2024-02-05, CAR Team 2024-02-19, CAR Team 2024-03-04, CAR Team 2024-03-18, CAR Team 2024-04-01, CAR Team 2024-04-29, CAR Team 2024-05-13, CAR Team 2024-05-27
Linked BF Score:
200
Story Points:
2
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In some cases, when a shard realizes that the filtering metadata is not properly installed, it fails the current execution of the command, forces a refresh and then it re-executes the original command. If that refresh ends up failing, the error that is bubbled up to the router is the StaleConfig one and not the new one we got from the refresh. This behavior in general makes a lot of sense: the router will retry the whole command and then the shard will execute the same steps as before, hoping that the refresh won't fail.

Bubbling up the StaleConfig instead of the proper error might be problematic in some cases, though: imagine that we get a NotPrimary exception. If we bubble up a StaleConfig exception the router won't realize that the primary has changed, potentially leading to a re-execution of the command with exactly the same consequences until we exhaust the retries.

causes

SERVER-117486 Shard overwrites bulk write response payload when metadata refresh is interrupted

Blocked

SERVER-90809 Test forcing filtering metadata refresh should tolerate transient client request failures

Closed

fixes

SERVER-69110 Consolidate stale collection/database exception handling on shards

Closed

is caused by

SERVER-78115 Shard primaries must commit a majority write before using new routing information from the config server

Closed

is depended on by

SERVER-86514 Cleanup featureFlagBubbleUpOriginalRefreshFailure once 8.0 releases

Closed

related to

SERVER-118652 Investigate prioritizing replication status check over versioning protocol

Backlog

SERVER-117235 Investigate ExceededTimeLimit retriability in multi-document transactions when waiting for refreshes

Closed

(2 related to)

Assignee:: Pol Pinol
Reporter:: Sergi Mateo Bellido
Participants:: Githook User, Pol Pinol, Sergi Mateo Bellido
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Jan 08 2024 08:28:42 AM UTC
Updated:: Feb 02 2026 09:51:58 AM UTC
Resolved:: May 13 2024 03:46:26 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates