Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.3.0-rc0, 8.2.6, 8.0.21, 7.0.35
Affects Version/s: 6.0.0, 7.0.0, 8.0.0, 8.3.0-rc0, 8.2.0
Component/s: Catalog, TTL
Labels:
None

Assigned Teams:

Catalog and Routing
Backwards Compatibility:
Fully Compatible
Backport Requested:

v8.2, v8.0, v7.0
Sprint:
CAR Team 2026-02-16
Story Points:
2
CAR Domain/s:

🟦 Shard Catalog

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When the TTL Monitor encounters a StaleConfig error indicating that the sharding metadata for a collection needs to be recovered, it spawns an async thread to execute that recovery and then moves on to the next collection. On clusters with many collections with TTL indexes, this can spawn a large number of threads, particularly during startup, where the sharding metadata is unknown for all collections. This can cause resource exhaustion due to the number of threads/memory, an also thundering heard effects on the configsvr handling the metadata refreshes. We should limit the amount of threads that the TTL Monitor can start for sharding metadata recovery.

is caused by

SERVER-63245 TTL Monitor thread doesn't recover the shard version

Closed

Assignee:: Jordi Serra Torrens
Reporter:: Jordi Serra Torrens
Participants:: Githook User, Jordi Serra Torrens
Votes:: 0 Vote for this issue
Watchers:: 13 Start watching this issue

Created:: Jan 29 2026 09:12:54 AM UTC
Updated:: Jun 16 2026 02:12:01 PM UTC
Resolved:: Feb 03 2026 04:07:23 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates

PagerDuty