Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 8.0.0, 8.2.0
Component/s: None
Labels:
None

Assigned Teams:

Catalog and Routing
CAR Domain/s:

🟦 Shard Catalog

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When a shard executing a bulk write consisting of multiple operations fails some of them, for example due to StaleConfig error, it produces an ok:1 response with a payload indicating which writes succeeded vs which ones did not. This allows correct reporting of the operation outcome and also allows mongos to perform retries when it is safe to do so.

Before responding to the mongos, shards attempt to recover the sharding metadata. However, if that fails due to an Interruption error, shards overwrite the ok: 1 response and instead throw top-level ok: 0 response. This causes the detailed per-operation outcome to be lost, which makes the mongos unable to determine the appropriate reties. That error is then propagated to the driver, without any information about what operations succeeded vs failed.

In the case of retryableWrites=true, the driver is able to retry safely the whole operation, so this is transparent to the app, although with some inefficiency due to retrying operations that definitely had succeed already.

In the case of retryableWrites=false, the driver is not able to retry and the app simply gets a top-level error that doesn't report the individual writes outcomes.

Shards should avoid discarding the response payload indicating the individual write operation outcomes.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

0001-repro.patch
Jan 21 2026 01:15:51 PM UTC
6 kB
Jordi Serra Torrens

depends on

SERVER-118652 Investigate prioritizing replication status check over shard versioning protocol

Backlog

is caused by

SERVER-84623 Shard-local re-execution of a command might bubble up a misleading StaleConfig exception to the router

Closed

Assignee:: Unassigned
Reporter:: Jordi Serra Torrens
Participants:: Jordi Olivares Provencio, Jordi Serra Torrens
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Jan 21 2026 01:15:53 PM UTC
Updated:: Feb 06 2026 11:32:51 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates