Add metrics to provide visibility into whether validations are working as intended and what impact they have

XMLWordPrintableJSON

    • Type: Task
    • Resolution: Duplicate
    • Priority: Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Cluster Scalability
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      These metrics will allow us to build the dashboard to have fleet-wide monitoring into whether validations are working as intended and the impact they have in SERVER-130028

      Metric Why it helps
      When resharding ran with validations enabled Can be used by dashboard to compare stats between clusters that ran with resharding validations and without it. 
      Validation success, failure, skipped or retried Can be aggregated into a dashboard to see overall validation success/failure rate.
      Duration of donor clone count and final collection count verification See if verifications are taking a long time, resulting in it timing out and being skipped
      Size-bucketed validation success and failure metrics for collections <= 100 GB, <= 1 TB, and > 1 TB  Aggregated into dashboard to see breakdown of validation success/failure based on collection size.

       

            Assignee:
            Unassigned
            Reporter:
            Wenqin Ye
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: