Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 5.0.0, 6.0.0, 7.0.0, 7.1.0-rc2
Component/s: None
Labels:
None

Assigned Teams:

Sharding NYC
Operating System:
ALL
Linked BF Score:
0
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

If resharding fails due to any error that is not user initiated (and hence user won't expect ReshardCollectionAborted), followed by config server step down + step up, we recover the abort decision at the newly elected config server by checking the state document and signaling the context holder to abort here. When doing this we overwrite the status as ReshardCollectionAborted here incorrectly thinking it is user-initiated. Note that the original status at the previous config server primary's ReshardingCoordinatorService is present in memory in this onError handler in code.

When checking if the context holder is aborted, we should additionally check if it was user-initiated and return the right status code.

duplicates

SERVER-73897 Resharding coordinator returns generic abort error after recovery from stepdown

Closed

Assignee:: [DO NOT USE] Backlog - Sharding NYC
Reporter:: Abdul Qadeer
Participants:: [DO NOT USE] Backlog - Sharding NYC, Abdul Qadeer
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Sep 09 2023 01:49:48 AM UTC
Updated:: Apr 17 2024 05:50:15 PM UTC
Resolved:: Sep 15 2023 03:25:18 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates