-
Type:
Improvement
-
Resolution: Fixed
-
Priority:
Major - P3
-
Affects Version/s: None
-
Component/s: Checkpoints
-
Storage Engines
-
2023-06-27 Lord of the Sprints, 2023-07-11 WiredTractor, 2023-07-25 Absolute unit, StorEng - 2023-08-08, ASeasonTooMany-2023-08-22
-
8
-
v7.0, v6.0
-
Needed
-
Cloud, Docs, DBX: DevTools (Compass, Shell, VS Code Ext), Triage and Release, TAR (DB Tools, Mongomirror, Mongosync)
-
Checkpointing is one of the most user-visibly disruptive maintenance operations WiredTiger does. Sometimes a lot of time is spent doing checkpoints, and it isn't clear what it is doing with that time.
We should add metrics and/or log lines that give better insight.
An example from a customer shows checkpoints completing in between 15 and 20 minutes, spending 4 minutes of that time writing content (2 at the start, 2 at the end), with no meaningful indication of what happens for the remaining 11 to 16 minutes. There is no meaningful time being spent flushing checkpoint content to disk.
A starting point for this is probably building or finding a workload that takes a long time to create a new checkpoint.
- causes
-
WT-12363 Concurrent checkpoint can produce misleading stats
-
- Closed
-
- is depended on by
-
COMPASS-9318 Investigate changes in WT-11171: Add metrics that give better insight into what checkpoint is doing
-
- Needs Triage
-
-
TOOLS-3832 Investigate changes in WT-11171: Add metrics that give better insight into what checkpoint is doing
-
- Closed
-