[SERVER-25277] FTDC subsystem shuts down when there are too many open files Created: 25/Jul/16  Updated: 10/Aug/16  Resolved: 08/Aug/16

Status: Closed
Project: Core Server
Component/s: Diagnostics
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Kamran K. Assignee: DO NOT USE - Backlog - Platform Team
Resolution: Done Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Related
Operating System: ALL
Participants:
Linked BF Score: 0

 Description   

I spotted this in a test log:

W FTDC     [ftdc] Uncaught exception in 'Location13538: couldn't open [/proc/3115/stat] Too many open files' in full-time diagnostic data capture subsystem. Shutting down the full-time diagnostic data capture subsystem.

I would expect FTDC to operate in a degraded mode instead of shutting down completely.



 Comments   
Comment by Justin Cohler [ 08/Aug/16 ]

See mark.benvenuto's comments above.

Comment by Mark Benvenuto [ 25/Jul/16 ]

FTDC does operate in a degraded mode when certain errors are hit, reports them, and continues the next iteration. When exceptions are thrown, it does not try to evaluate if it is safe to continue, and simply shuts the loop down. This exception comes from serverStatus.

Generated at Thu Feb 08 04:08:45 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.