Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Won't Fix
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Replication
Labels:
- perf-optimization-finder

Assigned Teams:

Product Performance
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Problem

LogicalSessionCacheImpl::vivify runs once per request from ServiceEntryPointShardRole::handleRequest → initializeOperationSessionInfo. On a 128-thread YCSB 100% read workload, it acquires an exclusive stdx::mutex on every call even though the steady-state path is just an absl::raw_hash_set::find plus a lastUse timestamp stamp. In the pinned-baseline pprof, vivify accounts for 0.44% cum of total mongod CPU and 52% of that is pthread_mutex_lock + pthread_mutex_unlock overhead — the mutex itself is the dominant cost, not the actual cache lookup. With a single shared mutex across all worker threads, this is pure cache-line ping-pong on the lock word at ~88K finds/sec.

Solution

Replace stdx::mutex _mutex with RWMutex _mutex, wrap each cache record in a stable-address CachedSessionEntry containing the existing LogicalSessionRecord plus an AtomicWord<long long> lastUseMs shadow, and split vivify() into a shared-lock fast path (find + atomic timestamp stamp) and an exclusive-lock slow path (insert / refresh / reap / endSessions, semantically unchanged). Callers that expose record.lastUse (peekCached, _refresh, _reap, getStats, listIds) reconcile the atomic shadow back into the record via _syncLastUseToRecord under exclusive lock before reading. The fast path no longer ping-pongs the mutex's cache line across cores; the slow path preserves every existing invariant — including the _refresh() swap-then-back-swap protocol that prevents the SERVER-123432 reap race.

Assignee:: Jawwad Asghar
Reporter:: Jawwad Asghar
Participants:: Jawwad Asghar
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: Apr 29 2026 05:00:08 PM UTC
Updated:: May 05 2026 05:55:08 PM UTC
Resolved:: May 05 2026 05:44:04 PM UTC

Details

Description

Problem

Solution

Attachments

Activity

People

Dates