Uploaded image for project: 'WiredTiger'
  1. WiredTiger
  2. WT-14550

fsync taking a meaningful proportion of checkpoint time

    • Type: Icon: Task Task
    • Resolution: Unresolved
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • Storage Engines
    • None
    • 5
    • 0

      SLS-1867 added an option to disable fsync for disaggregated storage, helpfully configured as the lose_all_my_data option.

      There wasn't a follow-on change to actually use this option in MongoDB, since there were bigger fish to fry during the performance sprint.

      However, the original observation that fsync is taking a meaningful proportion of checkpoint time (in disagg) still holds true. We should understand why this is, and stop doing that. It's possible that checkpoint is doing unnecessary work besides the fsync, that work is also in scope.

      A successful outcome here would be to ensure that local IO for checkpoints in disaggregated storage doesn't generate meaningful overhead. (It's OK to have some content written locally, but that mustn't be slow.)

            Assignee:
            backlog-server-storage-engines [DO NOT USE] Backlog - Storage Engines Team
            Reporter:
            will.korteland@mongodb.com Will Korteland
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: