Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Catalog and Routing
Operating System:
ALL
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In HELP-54194, we discovered that there are some commands that may fail when a removeShard is taking place / draining a shard. As an example “listIndexes”. It is expected to hit ShardNotFound as a transient error triggered by a specific timing and in a specific window of time, and bubble up to the user application. The command failed can be perfectly retried and successfully executed after that. The exact reproducible test is attached to the comments.

The problem is that the user will be able to see ShardNotFound bubble up when it may not be necessary, i.e. the mongos or driver (implementation decision) should retry the operation.

Summarizing, the goal of this ticket is to list all the commands triggered by the reproducible and investigate / work on a feasible solution to retry ShardNotFound without bubbling up to the user when is not necessary - as we do with other transient errors.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

repro.patch
4 kB
Jan 12 2024 02:48:18 PM UTC

Assignee:: [DO NOT USE] Backlog - Catalog and Routing
Reporter:: Pol Pinol
Participants:: [DO NOT USE] Backlog - Catalog and Routing, Pol Pinol
Votes:: 0 Vote for this issue
Watchers:: 9 Start watching this issue

Created:: Jan 12 2024 02:42:41 PM UTC
Updated:: Jun 05 2024 01:28:00 PM UTC

Details

Description

Attachments

Attachments

Activity

People

Dates