Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-92997

Allow errors when checking ReadWriteDistributionMetrics if analyzeShardKey request slower than splitPointExpirationSecs

    • Type: Icon: Task Task
    • Resolution: Fixed
    • Priority: Icon: Major - P3 Major - P3
    • 8.1.0-rc0, 8.0.0-rc17
    • Affects Version/s: None
    • Component/s: None
    • None
    • Cluster Scalability
    • Fully Compatible
    • v8.0
    • Cluster Scalability 2024-07-22
    • 0

      In the analyze_shard_key.js concurrency test, the expiration time for split point docs is set to 30 seconds to avoid making recovery extremely slow in suites that do restarts. However, we sometimes see the agg request (running $_analyzeShardKeyReadWriteDistribution) take longer than the expiration secs on the split point documents - this means the read/write metrics can be off if some (or all) split point docs have already been deleted. If this happens, we can ignore the failed assertions - one way of doing this could be to track the duration of the analyzeShardKey command in the workload, and if it is longer than the splitPointExpirationSecs value, ignore the error thrown by assertReadWriteDistributionMetrics in isAcceptableAnalyzeShardKeyError.

            Assignee:
            kruti.shah@mongodb.com Kruti Shah
            Reporter:
            janna.golden@mongodb.com Janna Golden
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: