Create Metric to Track Corrupted Pages Read from PageServer

XMLWordPrintableJSON

    • Storage Engines - Server Integration
    • SESI - 2026-01-13
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      We need to create a metric to detect and track instances when a WT (WiredTiger) page read from the PageServer is corrupted.

       Page corruption can occur at various points in the disaggregation architecture:

      1. Primary sending corrupted pages to the LogServer:
        Pages can be corrupted before being sent to the LogServer.
      2. Primary encryption errors:
        Pages may be encrypted incorrectly by the primary node due to issues with the Key Encryption Key (KEK) or Data Encryption Key (DEK).
      3. PageMaterializer corruption:
        Pages may get inadvertently corrupted while being processed by the PageMaterializer.
      4. PageServer corruption:
        Pages may become corrupted within the PageServer itself.

      Implementing this metric will enable us to monitor and promptly identify corruptions to minimize their operational effects.

            Assignee:
            Unassigned
            Reporter:
            Ernesto Rodriguez Reina
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated: