[SERVER-21818] Capture system metrics in FTDC Created: 09/Dec/15 Updated: 25/Apr/17 Resolved: 26/Jul/16 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Diagnostics |
| Affects Version/s: | None |
| Fix Version/s: | 3.2.13, 3.3.11 |
| Type: | Improvement | Priority: | Major - P3 |
| Reporter: | Bruce Lucas (Inactive) | Assignee: | Mark Benvenuto |
| Resolution: | Done | Votes: | 3 |
| Labels: | monitoring | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Attachments: |
|
||||||||||||||||||||||||||||||||||||||||||||
| Issue Links: |
|
||||||||||||||||||||||||||||||||||||||||||||
| Backwards Compatibility: | Fully Compatible | ||||||||||||||||||||||||||||||||||||||||||||
| Sprint: | Platforms 15 (06/03/16), Platforms 18 (08/05/16) | ||||||||||||||||||||||||||||||||||||||||||||
| Participants: | |||||||||||||||||||||||||||||||||||||||||||||
| Description |
|
Currently full-time data capture only includes internal metrics (with a small number of exceptions). It would be useful to also capture system metrics related to cpu, memory, and storage. For illustrative purposes attached is a POC data capture tool sysmon.py that captures such information on Linux from /proc/stat, /proc/meminfo, and /sys/block/*/stat that has proven useful for problem diagnosis. Captured information includes the following: /proc/stat /proc/meminfo /sys/block/*/stat Similar metrics are available through Windows APIs. Where applicable cumulative counters are preferred over instantaneous values because cumulative counters can be sampled at arbitrary time intervals. In general raw system-specific metrics with a minimum of processing are preferred, leaving it to tooling to subsample as needed and compute useful values for display. (An exception might be for example that sectors could be converted to bytes because sector may be a system- or device-specific unit.) |
| Comments |
| Comment by Mark Benvenuto [ 26/Jul/16 ] |
|
Linux and Windows implementations have been completed. |
| Comment by Andrew Morrow (Inactive) [ 26/Jul/16 ] |
|
Mark please resolve this ticket, we held it open as a tracking ticket but all of the associated work has been completed. |
| Comment by Mark Benvenuto [ 24/May/16 ] |
|
lucas.hrabovsky Thanks for the reference. sigar supports a lot of operating systems, and a lot of very old versions (like NT4). I will use as a reference if I need to find the various APIs to gather data. |