Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: WT11.3.0, 8.1.0-rc0, 8.0.5, 7.0.18
Affects Version/s: None
Component/s: Checkpoints, Compaction
Labels:
- code-quality

Assigned Teams:

Storage Engines
Sprint:
2024-08-06 - Withholding Tax
Story Points:
8

Backport Requested:

v8.0, v7.0, v6.0, v5.0

Background

Compaction and checkpoint may conflict, whenever this happens, one may see:

                    WT_ERR_MSG(session, EBUSY,
                      "Compaction halted at data handle %s by eviction pressure. Returning EBUSY.",
                      session->op_handle[i]->name);

This happens when compaction and checkpoint are trying to get the same lock at the same time, see here:

    /*
     * We could corrupt a checkpoint if we moved a block that's part of the checkpoint, that is, if
     * we race with checkpoint's review of the tree. Get the tree's flush lock which blocks threads
     * writing pages for checkpoints, and hold it long enough to review a single internal page. Quit
     * working the file if checkpoint is holding the lock, checkpoint holds the lock for relatively
     * long periods.
     */
    WT_RET(__wt_spin_trylock(session, &S2BT(session)->flush_lock));

Problem

This conflict has been evident in multiple help tickets (linked below). In these cases, compact both took a long time to complete and recovered little space relative to the space available according to the bytes available for reuse statistic. This could be because the checkpoint conflict causes the compact tree walk to exit early before it's able to reach the pages it needs to rewrite from the end of the file. Then, compact will restart its walk each time and repeatedly read and evict the same internal pages.

Acceptance Criteria

Create a test to reproduce the case where checkpoint contends with compact due to the flush_lock and compact exits its walk early.
- Verify if this causes compact to fail to reclaim space.
Ensure compact still reclaims space in this scenario

Consider if compaction could block checkpoint if it does not let checkpoint get the flush_lock. This could be proven by doing the following:

diff --git a/src/btree/bt_compact.c b/src/btree/bt_compact.c
index f0e8bae52..db41a81c2 100644
--- a/src/btree/bt_compact.c
+++ b/src/btree/bt_compact.c
@@ -254,6 +254,7 @@ __compact_walk_internal(WT_SESSION_IMPL *session, WT_REF *parent)
      */
     overall_progress = false;
     WT_INTL_FOREACH_BEGIN (session, parent->page, ref) {
+        // Sleep and check that checkpoint is waiting.
         if (F_ISSET(ref, WT_REF_FLAG_LEAF)) {
             WT_ERR(__compact_page(session, ref, &skipp));
             if (!skipp)

Fix how compact walk handles EBUSY from checkpoint flush_lock

Background

Problem

Acceptance Criteria

Suggested Solutions

Details

Description

Background

Problem

Acceptance Criteria

Suggested Solutions

Attachments

Attachments

Activity

People

Dates