Decrease load on locust primary for resharding failover workload

XMLWordPrintableJSON

    • Type: Task
    • Resolution: Fixed
    • Priority: Major - P3
    • 8.3.0-rc0
    • Affects Version/s: None
    • Component/s: None
    • None
    • Cluster Scalability
    • Fully Compatible
    • 0
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      As seen in BF-39078, the aggressive failover testing is overloading the locust primary's cpu causing the the workload to timeout while waiting for the resharding operation to finish. The following improvements should be considered to decrease the load:

      • decrease number of read and write users
      • remove priority on the config server to avoid extra failovers
      • reduce cadence of stepdowns on config server

       

            Assignee:
            Kruti Shah
            Reporter:
            Kruti Shah
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

              Created:
              Updated:
              Resolved: