Capture Detailed File Copy Based Initial Sync Metrics

XMLWordPrintableJSON

    • Type: Improvement
    • Resolution: Duplicate
    • Priority: Major - P3
    • None
    • Affects Version/s: 8.2.0-rc0
    • Component/s: Replication
    • Replication
    • None
    • 3
    • TBD
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      There's very little visibility into the aggregate behavior and operation of file copy based initial syncs today it's very difficult to answer across a large number of clusters:

      • What is the normalized throughput of a file copy based initial sync
      • What is the success rate of file copy based initial syncs 
      • What is the typical duration of an file copy based initial sync
      • What phase of file copy based initial syncs do most failures occur
      • How many file copy based initial syncs are there on a given day

      To assist in answering these questions and others we should capture relevant metrics. We can take inspiration from resharding metrics.

      When complete it should be possible to build a funnel chart/diagram detailing clusters progress through the end-to-end file copy based initial sync process and charts detailing the performance of file copy based initial syncs.  

              Assignee:
              Unassigned
              Reporter:
              Matt Panton
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

                Created:
                Updated:
                Resolved: