Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Done
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Epic Link:
SPM-1407
Sprint:
Storage - Ra 2021-05-17
Story Points:
3

By far the biggest time consumer during the gathering of handles is the time spent to parse the checkpoint strings from the metadata into the ckpt array via wt_meta_ckptlist_get. By far the most common scenario is that the checkpoint thread loads the most recent checkpoint WiredTigerCheckpointN into ckpt[0] and then allocates one for the in-progress checkpoint WiredTigerCheckpointN+1 at ckpt[1]. Checkpoint then fills in and generates the metadata string for ckpt[1]. Then on the next checkpoint it reads in the string and parses WiredTigerCheckpointN+1 into ckpt[0] and allocates one at ckpt[1] for WiredTigerCheckpointN+2. And so on. If there are no metadata changes for this dhandle other than this pattern, we could store the most recent ckpt structures in the dhandle between checkpoints so that on the next checkpoint we don’t have to reread and reparse the strings.

If anything changes the metadata at all then those stored ckpt structures are invalidated and we would continue with the slow path we have today. But if the pattern holds, then we may have a very rapid gathering of handles. Code paths that would invalidate these stored structures include (but are not limited to) the WT_SESSION::alter API, using named checkpoints in any way, any code path that checkpoints a file individually.

The also-common path of taking checkpoints while a backup cursor is opened should also be part of this work. In that case we would simply have an extra checkpoint stored: the backup checkpoint that we retain until the cursor is closed. This change should work with backups.

I believe, but have not fully thought through, that we only need to keep the most recent checkpoint (i.e. the one being generated this time) for comparison or basis for the next checkpoint. Maybe we need to keep them all stored (at most 2 old ones plus the current one we’re creating) so that we can do the deletion checks, etc.

However avoiding the string parsing will be the biggest win when you have thousands of files to walk through.

is related to

WT-7381 Cache btree's ckptlist between checkpoints

Closed

WT-7421 Handle backup checkpoints when maintaining most recent checkpoints

Closed

WT-7422 Find and account for all code paths that invalidate cached checkpoint information

Closed

related to

WT-7335 Add statistics and debug functions to get insight into dhandle usage

Closed

Assignee:: Sulabh Mahajan
Reporter:: Sulabh Mahajan
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Mar 11 2021 07:28:35 AM UTC
Updated:: May 11 2021 10:22:51 AM UTC
Resolved:: May 11 2021 10:22:51 AM UTC

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates