-
Type:
Task
-
Resolution: Unresolved
-
Priority:
Major - P3
-
None
-
Affects Version/s: None
-
Component/s: None
-
None
-
Workload Resilience
-
None
-
None
-
None
-
None
-
None
-
None
-
None
To better observe how the rate limiter is affecting individual clusters / nodes, we need a dashboard showing establishment metrics (current rates, queues, rejections, etc) for each node in a cluster, alongside various generic health metrics for those nodes.
This will be used both to determine effectiveness of the static limits (e.g. if it is too aggressive and it’s limiting establishments when the system has many available resources, or if it's too conservative and it kicks in later than it should have) and for incident response.
- related to
-
SERVER-120198 Identify required metrics for connection establishment rate limiting dashboards
-
- Open
-
-
SERVER-120192 Create fleet-wide monitoring dashboard for connection establishments
-
- Needs Scheduling
-