Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-8163

Consider more eviction scenarios to give up checkpoint-cleanup

    • Type: Icon: Improvement Improvement
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • WT10.0.1, 5.2.0, 5.1.0-rc2, 5.0.5
    • Affects Version/s: None
    • Component/s: None
    • None
    • 8
    • Storage - Ra 2021-10-18

      WT-7787 made a change to give up checkpoint cleanup if the cache gets into aggressive mode. The idea was to postpone cleanup to the next checkpoint, as eviction has begun to struggle.

      In a customer case, we saw a similar issue that we tried to resolve in WT-7787, but the cache was not set to aggressive because eviction system thinks there is not much to evict and it is doing a good job of evicting what it can find. This when the application was actually stalled because dirty was continuously 20%+.

      I think we should refine the eviction condition the changes from WT-7787 are based on, as looking for an aggressive set might not be sufficient.

      This ticket should relook at the customer issue, understand it deeper and study if there is more to it than I have mentioned, even though checkpoint cleanup and touching 20% dirty seem the root cause to me. The outcome should be understanding what eviction was up to, why it is unable to bring down dirty content, and if we can refine changes from WT-7787 to mitigate the issue.

        1. orig_code.png
          orig_code.png
          296 kB
        2. more_checks.png
          more_checks.png
          293 kB
        3. haris_suggestion.png
          haris_suggestion.png
          300 kB
        4. cache-20-dirty-cleanup.png
          cache-20-dirty-cleanup.png
          367 kB

            Assignee:
            etienne.petrel@mongodb.com Etienne Petrel
            Reporter:
            sulabh.mahajan@mongodb.com Sulabh Mahajan
            Haribabu Kommi, Sulabh Mahajan
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

              Created:
              Updated:
              Resolved: