Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.2.0-rc0, 8.1.2
Affects Version/s: None
Component/s: None
Labels:

Assigned Teams:

Replication
Backwards Compatibility:
Fully Compatible
Backport Requested:

v8.1, v8.0, v7.0, v6.0
Sprint:
Repl 2025-03-31, Repl 2025-04-14, Repl 2025-05-12, Repl 2025-05-26, Repl 2025-06-09
Linked BF Score:
200
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Right now we approximate this mutex contention by looking at the ping time (since that ends up taking the replication coordinator mutex), we should add a metric that can track this directly. We should be able to time how long it takes to take the mutex. I'm not sure if this can cause enough increased load on the mutex to make the situation worse for customers.

At the very least we could add a command that just takes the mutex and releases it if we wanted to time how long that command took. We don't have to call the command as part of collecting FTDC, but we could manually call it for clusters that we suspect mutex contention.

Assignee:: Evelyn Wu
Reporter:: Samyukta Lanka
Participants:: Evelyn Wu, Githook User, Samyukta Lanka
Votes:: 0 Vote for this issue
Watchers:: 12 Start watching this issue

Created:: Nov 14 2024 09:23:23 PM UTC
Updated:: Jun 17 2025 11:33:24 PM UTC
Resolved:: May 29 2025 09:26:34 PM UTC

Details

Description

Attachments

Activity

People

Dates