Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Networking & Observability
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After a large network disruption, the RSM will attempt to rediscover hosts whose monitoring connections were severed. In large clusters, this could result in a burst of monitoring connection establishments, which may result in network congestion, DNS server overload, or contention on the RSM's reactor thread. If a randomized delay were used when scheduling the first monitoring request after a previously monitored server became marked as Unknown, it could help to mitigate these issues.

Assignee:: Unassigned
Reporter:: Patrick Freed
Participants:: Patrick Freed
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Nov 06 2024 10:43:15 PM UTC
Updated:: Nov 12 2024 07:57:46 PM UTC

Details

Description

Attachments

Activity

People

Dates