Investigate long running checkpoints on selected clusters using FTDC analysis

XMLWordPrintableJSON

    • Type: Sub-task
    • Resolution: Done
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: Checkpoints
    • None
    • Storage Engines - Persistence
    • 1,126.543
    • SE Persistence backlog
    • None

      Issue Summary

      Long running checkpoints have been observed on selected clusters. The goal is to analyze FTDC data from these clusters to determine root cause and next steps.

      Context

      • FTDC data needs to be pulled from the affected clusters.
      • The analysis should answer:
        • Do we have enough information to RCA the long running checkpoints?
        • If not, what additional data is required?
        • If the issue is known, what backports are missing?
        • If the issue is new, a follow-up ticket should be created to address it.

      Proposed Solution

      • Pull FTDC data from the selected clusters.
      • Perform analysis to identify the root cause of long running checkpoints.
      • Document whether the available data is sufficient for RCA.
      • If insufficient, list additional data requirements.
      • If the issue is known, identify and document missing backports.
      • If the issue is new, create a ticket to address the new problem.

      Original Slack thread: Slack Thread
      This ticket was generated by AI from a Slack thread.

            Assignee:
            Etienne Petrel
            Reporter:
            Memento Slack Bot
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: