Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Catalog and Routing
Backwards Compatibility:
Fully Compatible
Sprint:
CAR Team 2024-10-28, CAR Team 2024-11-11, CAR Team 2024-11-25
Linked BF Score:
200
Story Points:
3
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

checkMetadataConsistency acquires a MODE_S lock on the database to guarantee some catalog stability during its checks. This strong lock interacts poorly with long running intention locks, such as resharding. For instance:

Resharding acquires IX lock on DB
checkMetadataConsistency enqueues MODE_S lock
Any write that tries to acquire another IX lock will block behind the MODE_S attempt, until resharding+checkMetadataConsistency complete or the MODE_S lock times out (5 minutes by default).

This is a potential problem for production and also for testing, since we run checkMetadataConsistency in the background and some suites also run background collection migrations (moveCollection/resharding).

One idea is to have a try-lock API with backoff such that the MODE_S lock is not enqueued right away. If the operation would starve we can either fail it or eventually enqueue it as we do today.

Assignee:: Wolfee Farkas
Reporter:: Daniel Gomez Ferro
Participants:: Daniel Gomez Ferro, Githook User, Wolfee Farkas
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Jul 08 2024 08:57:37 AM UTC
Updated:: Nov 22 2024 11:06:42 AM UTC
Resolved:: Nov 22 2024 11:06:41 AM UTC
Confidence Status Last Update:: 15/Nov/24 10:32 AM

Details

Description

Attachments

Activity

People

Dates