Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: WT10.0.1, 5.2.0, 5.1.0-rc2, 5.0.5
Affects Version/s: None
Component/s: None
Labels:
None

Epic Link:
SPM-2054
Sprint:
Storage - Ra 2021-10-04, Storage - Ra 2021-10-18
Story Points:
8

alexander.gorrod, something that occurred to me when haribabu.kommi and I were working on the new sync code is that the changes made to support checkpoint's garbage collection of the history store (identifying WT_REFs by type in all cases and reworking the WT_REF locking), were applicable to file compaction as well.

Currently compaction has a big weakness: it walks the tree, and for every leaf page in the file it checks to see if the page should be rewritten. If the page should be rewritten and isn't in cache, it's read into the cache, marked dirty and then written by normal eviction. This implies a couple of bad things: first, compaction spikes cache use and makes eviction less effective, and second, we're wasting a lot of work to read a page, convert it to it's in-memory representation, reconcile the page and finally write the page (for example, checksumming, encryption and compression on both the read and write paths).

Using the same techniques as we're using in the new checkpoint code, we could change things so pages that aren't in memory are rewritten in a single call to the block manager, avoiding all the bad behavior, and probably improving our compression story since we won't have to wait for eviction to write the page.

Additionally, the compaction code quits if eviction is struggling, and that's a pain point that comes up every now and then, where a MongoDB installation can't get compaction working because eviction is stressed too much, so compaction starts up and then fails without making progress.

I took a look at this today, and think this approach will work. I'd estimate about two days of work, plus possible testing fallout, of course.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Screen Shot 2021-10-11 at 12.57.36 pm.png
283 kB
Oct 11 2021 01:24:43 PM UTC
test_compact04.py
5 kB
Oct 11 2021 01:31:11 PM UTC

has to be done before

WT-6077 Add new stats to track compact progress

Closed

WT-2298 Rewrite overflow items in compact

Closed

is related to

WT-7985 Create unit test to check compact quits early when eviction is struggling

Closed

WT-7986 Understand the code changes done in WT-6001

Closed

related to

WT-7873 Investigate improving compact efficiency by having block manager identify blocks to migrate

Closed

Assignee:: Ravi Giri
Reporter:: Keith Bostic (Inactive)
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: Apr 12 2020 09:17:55 PM UTC
Updated:: Oct 29 2023 04:43:56 PM UTC
Resolved:: Oct 18 2021 12:06:30 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates