Capture Detailed File Copy Based Initial Sync Metrics

XMLWordPrintableJSON

    • Type: Improvement
    • Resolution: Duplicate
    • Priority: Major - P3
    • None
    • Affects Version/s: 8.2.0-rc0
    • Component/s: Replication
    • Replication
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      There's very little visibility into the aggregate behavior and operation of file copy based initial syncs today it's very difficult to answer across a large number of clusters:

      • What is the normalized throughput of a file copy based initial sync
      • What is the success rate of file copy based initial syncs 
      • What is the typical duration of an file copy based initial sync
      • What phase of file copy based initial syncs do most failures occur
      • How many file copy based initial syncs are there on a given day

      To assist in answering these questions and others we should capture relevant metrics. We can take inspiration from resharding metrics.

      When complete it should be possible to build a funnel chart/diagram detailing clusters progress through the end-to-end file copy based initial sync process and charts detailing the performance of file copy based initial syncs.  

            Assignee:
            Unassigned
            Reporter:
            Matt Panton
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: