Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Done
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Tools
Labels:
None

Assigned Teams:

Storage Engines, Storage Engines - Persistence
Total Hours with Assigned Team:
0.001
Epic Link:
Checkpoint size for disagg
Sprint:
SE Persistence backlog
Story Points:
None

Issue Summary

We want to create observability around database size through Grafana, surfacing the clusters that consume the most storage in the disaggregated (DSC) world.

Context

Part of the WT-16385 effort to define and expose checkpoint/database size for disagg.
The idea is to print the top N clusters with the largest database size and/or the ones exceeding a specific value.
In Grafana, the metric to use is mongodb_wiredTiger_disagg_database_size.

Proposed Solution

Build a Grafana panel/dashboard using one of the following queries:

# Sort all clusters
sort_desc(
  sum(
    mongodb_wiredTiger_disagg_database_size{rs_state="1", xgen_environment=~"dev", xgen_provider=~"aws"}
  ) by (xgen_environment, xgen_provider, cl_name, rs_name)
)

# Select the top N
topk(
  20,
  sort_desc(
    sum(
      mongodb_wiredTiger_disagg_database_size{rs_state="1", xgen_environment=~"dev", xgen_provider=~"aws"}
    ) by (xgen_environment, xgen_provider, cl_name, rs_name)
  )
)

# Hard cutoff
sum(
  mongodb_wiredTiger_disagg_database_size{rs_state="1", xgen_environment=~"dev", xgen_provider=~"aws"}
) by (xgen_environment, xgen_provider, cl_name, rs_name)
> 1000000000000000000

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

image-2026-06-03-09-13-04-651.png
71 kB
Jun 03 2026 04:13:05 PM UTC
image-2026-06-03-09-28-15-174.png
163 kB
Jun 03 2026 04:28:16 PM UTC
image-2026-06-03-09-28-53-573.png
216 kB
Jun 03 2026 04:28:54 PM UTC

Assignee:: Vamsi Boyapati
Reporter:: Etienne Petrel
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Jun 02 2026 09:42:13 PM UTC
Updated:: Jun 09 2026 08:53:58 PM UTC
Resolved:: Jun 03 2026 05:55:06 AM UTC

Details

Description

Issue Summary

Context

Proposed Solution

Attachments

Attachments

Activity

People

Dates