[COMPASS-6998] Investigate changes in SERVER-76001: Add serverStatus metrics for PeriodicThreadToAbortExpiredTransactions Created: 04/Jul/23  Updated: 29/Jan/24  Resolved: 29/Jan/24

Status: Closed
Project: Compass
Component/s: None
Affects Version/s: None
Fix Version/s: No version

Type: Investigation Priority: Major - P3
Reporter: Backlog - Core Eng Program Management Team Assignee: Unassigned
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Depends
depends on SERVER-76001 Add serverStatus metrics for Periodic... Closed
Epic Link: COMPASS-6403
Documentation Changes: Not Needed

 Description   
Original Downstream Change Summary

New serverStatus field:
"metrics.abortExpiredTransactions.passes"

A 'pass' is scheduled periodically, and aborts transactions older than 'transactionLifetimeLimitSeconds'. "passes" is incremented after a successful pass.

For Triage: If the metric stops incrementing, the thread could be stuck trying to abort a transaction.

Description of Linked Ticket

This thread is imperative to ensure forward progress of the system. Since we've seen multiple hangs where the thread gets stuck, we should add serverStatus metrics which track the number of of transactions killed by the thread.

Additionally, we should consider adding a log line to indicate which session(s) the thread is trying to abort.



 Comments   
Comment by Anna Henningsen [ 29/Jan/24 ]

No impact

Comment by PM Bot [ 04/Jul/23 ]

Fix Version updated for upstream SERVER-76001:
7.1.0-rc0

Generated at Wed Feb 07 22:44:59 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.