Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 5.0.3, 5.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Backwards Compatibility:
Fully Compatible
Backport Requested:

v5.0
Sprint:
Query Optimization 2021-06-14, Query Optimization 2021-06-28, Query Optimization 2021-07-12, Query Optimization 2021-07-26
Linked BF Score:
43
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When fixing ~~SERVER-54507~~ we discussed a possible future optimization in preparing to execute $merge. The idea is to check the targetCollectionVersion epoch against matching later on when you are not holding the DB lock, or call

ShardServerProcessInterface::checkRoutingInfoEpochOrThrow()

not under a DB lock right before execution of the query on the leaf nodes of the merge topology.

Why would this be better?

The short answer:
Because the shard is serving as a router and not as a shard, so this epoch check also doesn't matter theoretically.
The long answer:
Because the current check only makes sure that this MongoD (pretending to be a router) knows at least as much as the router which sent the merge command. However, in the grand scheme of things both can be wrong and agree on the same wrong thing. It's more correct to have the leaf nodes of the merge topology do the epoch check or to call

ShardServerProcessInterface::checkRoutingInfoEpochOrThrow()

Which avoids the case where the leaf nodes that do the data reads know about a dropped collection and the mongos doesn't at the time it sends the targetCollectionVersion to the mongod acting as the router.

Note that there could still be a pathological case where the merge topology has 2 leaf nodes and one is reached much earlier than the second, and the first one processes Petabytes of data when the collection is dropped on the second leaf. The only theoretical way to get around this is to probably open cursors on all shards that will participate in the merge plan, but that would be possibly infeasible.

related to

SERVER-54507 Can't execute $merge if sharding catalog cache is empty

Closed

Assignee:: Nicholas Zolnierz
Reporter:: Eric Cox (Inactive)
Participants:: Eric Cox, Githook User, Kaloian Manassiev, Nicholas Zolnierz, Vivian Ge
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: May 07 2021 05:35:41 PM UTC
Updated:: Oct 29 2023 09:53:53 PM UTC
Resolved:: Jul 15 2021 10:29:21 PM UTC
Confidence Status Last Update:: 08/Jun/21 3:06 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates