Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 7.1.0-rc0, 6.0.10, 5.0.21, 7.0.2
Affects Version/s: 5.0.0, 6.0.0, 7.0.0, 7.1.0-rc0
Component/s: Sharding
Labels:
- sharding-nyc-subteam1

Assigned Teams:

Sharding NYC
Backwards Compatibility:
Fully Compatible
Backport Requested:

v7.0, v6.0, v5.0
Sprint:
Sharding NYC 2023-09-04
Case:
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Pasting Max's findings:

The problematic area is in https://github.com/mongodb/mongo/blob/r5.0.19/src/mongo/db/s/resharding/resharding_oplog_fetcher.cpp#L202-L203 where likely at the time of writing the code it was assumed because the function returns a StatusWith<> result it wouldn't be throwing an exception yet it seems like the function can also throw an exception. And so the exception causes the function to propagate an error rather than swallowing the error and retrying by doing the return true.

The ReshardingRecipientService should retry on transient NetworkTimeoutError category errors too in any retry loop. Since the change will be done in resharding_future_util.h, this improvement should affect all code using resharding::withAutomaticRetry

is related to

SERVER-58389 Capture NetworkInterfaceExceededTimeLimit and MaxTimeMSExpired errors in resharding participants

Closed

SERVER-72055 NetworkInterfaceTL should by default return a retryable error when it times out waiting to acquire a connection

Closed

related to

SERVER-80020 The exhaustiveFindOnConfig() method should retry on NetworkInterfaceExceededTimeLimit errors

Backlog

Assignee:: Abdul Qadeer
Reporter:: Abdul Qadeer
Participants:: Abdul Qadeer, Githook User, Max Hirschhorn
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Aug 05 2023 04:15:51 AM UTC
Updated:: Oct 29 2023 09:17:54 PM UTC
Resolved:: Aug 28 2023 05:58:48 PM UTC
Confidence Status Last Update:: 24/Aug/23 3:45 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates