Uploaded image for project: 'Core Server'
  1. Core Server
  2. SERVER-96813

Utilize jitter when trying to rediscover a host after a failed monitoring requests

    • Type: Icon: Improvement Improvement
    • Resolution: Unresolved
    • Priority: Icon: Major - P3 Major - P3
    • None
    • Affects Version/s: None
    • Component/s: None
    • None
    • Networking & Observability

      After a large network disruption, the RSM will attempt to rediscover hosts whose monitoring connections were severed. In large clusters, this could result in a burst of monitoring connection establishments, which may result in network congestion, DNS server overload, or contention on the RSM's reactor thread. If a randomized delay were used when scheduling the first monitoring request after a previously monitored server became marked as Unknown, it could help to mitigate these issues.

            Assignee:
            Unassigned Unassigned
            Reporter:
            patrick.freed@mongodb.com Patrick Freed
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated: