Investigate long running checkpoints on selected clusters using FTDC analysis

XMLWordPrintableJSON

    • Type: Task
    • Resolution: Unresolved
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None

      Issue Summary

      Long running checkpoints have been observed on selected clusters. The goal is to analyze FTDC data from these clusters to determine root cause and next steps.

      Context

      • FTDC data needs to be pulled from the affected clusters.
      • The analysis should answer:
        • Do we have enough information to RCA the long running checkpoints?
        • If not, what additional data is required?
        • If the issue is known, what backports are missing?
        • If the issue is new, a follow-up ticket should be created to address it.

      Proposed Solution

      • Pull FTDC data from the selected clusters.
      • Perform analysis to identify the root cause of long running checkpoints.
      • Document whether the available data is sufficient for RCA.
      • If insufficient, list additional data requirements.
      • If the issue is known, identify and document missing backports.
      • If the issue is new, create a ticket to address the new problem.

      Original Slack thread: Slack Thread
      This ticket was generated by AI from a Slack thread.

            Assignee:
            Mariam Mojid
            Reporter:
            Memento Slack Bot
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated: