Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.1.8
Affects Version/s: None
Component/s: Sharding
Labels:
- ShardedTxn:RouterSupport

Backwards Compatibility:
Fully Compatible
Sprint:
Sharding 2019-01-28, Sharding 2019-02-11
Linked BF Score:
17
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When mongos encounters an error during a transaction, if the error is "retryable" (e.g. snapshot error on first client statement), it will remove newly added participants from the participant list and retry the request, relying on shards implicitly aborting transactions started for the first attempt before servicing the new one.

If the operation on the router is killed (e.g. by killOp) after it clears pending participants but before it re-targets, the router will not know to send abort to the shards targeted by the first attempt, which may leave transactions open. To handle this and to simplify the contract around router retries, the router should instead send abortTransaction to all shards it removes from the participant list (waiting for all responses) before retrying. The ability for a shard to start a new transaction at the same number as an in-progress one should also be removed.

Assignee:: Jack Mulrow
Reporter:: Jack Mulrow
Participants:: Githook User, Jack Mulrow
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Jan 22 2019 06:13:14 PM UTC
Updated:: Oct 29 2023 10:24:55 PM UTC
Resolved:: Jan 29 2019 10:34:54 PM UTC
Confidence Status Last Update:: 24/Jan/19 9:22 PM

Details

Description

Attachments

Activity

People

Dates