Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Won't Fix
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

This new behavior would help in the HELP ticket incident. While we have the Enterprise Watchdog monitoring the storage health the Community edition mongod primary can be stuck on a faulty drive for hours without stepping down. The Watchdog targets this problem fast, but there is no good story for community edition at all.

While the Enterprise Watchdog will continue providing premium services, the Enterprise edition will have a more generic slower solution, however still preventing a multi-hour outage. The reaction time will be different by design, maintaining the service differentiation: Watchdog is capable to detect such outage as fast as 10-30 seconds (based on configuration) while the thread liveness monitor will achieve identical result after 5-10 minutes of outage.

Assigning to shameek.ray to make this blocked on the PM ticket he is creating.

Assignee:: Shameek Ray
Reporter:: Andrew Shuvalov (Inactive)
Participants:: Andrew Shuvalov, Shameek Ray
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Apr 07 2021 02:41:57 PM UTC
Updated:: May 11 2021 01:52:27 PM UTC
Resolved:: May 11 2021 01:52:26 PM UTC

Details

Description

Attachments

Activity

People

Dates