Umbrella ticket for Faulty Mongos project v5.0 backport

XMLWordPrintableJSON

    • Type: Improvement
    • Resolution: Fixed
    • Priority: Major - P3
    • 5.3.0
    • Affects Version/s: None
    • Component/s: None
    • Fully Compatible
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      1. Manually examine all test logs at Waterfall, especially LDAP
      2. Manage all backport tickets

      Ticket List for the mainline:

      SERVER-58152 Create Feature flag for Remove Faulty Mongos From Cluste…
      SERVER-59356 Initial scaffolding of the FaultManager
      SERVER-59357 Initial scaffolding of the Fault class
      SERVER-59358 FaultFacet class initial scaffolding and unit test
      SERVER-59362 Setup Fault Manager State Machine
      SERVER-59522 HealthCheckStatus should track fault status and lifetime
      SERVER-59360 Health observer registration and basic mock class
      SERVER-59496 Fault class is made to be a container of fault facets
      SERVER-59567 Make the HealthManager to instantiate HealthObservers
      SERVER-59567 Health observers are invoked periodically and create a F…
      SERVER-59608 remove const from return type
      SERVER-59367 state machine transition when entering the transient fau…
      SERVER-59912 changes in the base health package for the Ldap health o…
      SERVER-59361: Implement periodic health check thread pool
      SERVER-60316 FaultManager should start with periodic checks disabled
      SERVER-59370: unify transitionState test code path and production code
      SERVER-60079 Make health checkers asynchronous, block check until pre…
      SERVER-59364 Should move to the OK state after performing a successfu…
      SERVER-60587 Implement FaultFacet and make necessary changes in Healt…
      SERVER-59370: Should Transition to ActiveFault state when in the Tran…
      SERVER-59396 Adds server parameter healthMonitoring
      SERVER-59366 Progress monitor for periodic health check
      SERVER-61071 Removes all instances of HealthObserverIntensity
      SERVER-61073 fix getParameter on healthMonitoring
      SERVER-61368 SERVER-61315 Ldap health check executor should support a…
      SERVER-59373 adds new server status section
      SERVER-61368 fix link error on RHEL 8.0 Shared Library (No SSL)
      SERVER-59365 new state machine implementation for FaultManager
      SERVER-59365 fix ASAN link error
      SERVER-61438 fix race in FaultManagerTest
      SERVER-61872 Fixed thread pool starvation in FaultManager
      SERVER-61871 use tassert for state machine programmer errors
      SERVER-61921 fix link error in noSSL mode
      SERVER-59365: Use the new state machine.
      SERVER-59397 Add jitter when scheduling next health check
      SERVER-61956 fix data race when accessing the state machine's state
      SERVER-61914: add fault facet details to FaultImpl::toBSON
      SERVER-59382: Enforce non-critical facets not entering ActiveFault state
      SERVER-61873 add configurable health observer parameters
      SERVER-61220 Integration test for progress monitor
      SERVER-62096 test should not rely on /smaps, reduce verbosity
      SERVER-62084 unify FaultFacetType serialization implementations
      SERVER-62098: Guard access to healthCheckContexts with a mutex
      SERVER-61930: Individual health observers should return an error if a
      SERVER-59368 runtime change of intensities values
      SERVER-60944 Simplify Fault class hierarchy and interface for updatin…
      SERVER-62188 fix memory corruption in the DeadlineFuture
      SERVER-62197: Get rid of potential deadlock.
      SERVER-62203: rename thread name
      SERVER-62202 add observer type as string to log 5936504
      SERVER-62204 do not schedule health check if observer is not enabled
      SERVER-62174 Refactored health check intervals
      SERVER-58153 Enable Feature flag for Remove Faulty Mongos From Cluste…
      SERVER-60846 replace double severity with enum type
      SERVER-62357 Increase the default health check progress monitor interval
      SERVER-62378 Remove improperly merged lines from unit test
      SERVER-62404: Simplify mutex locking in fault_manager.cpp
      SERVER-62321: Increase kActiveFaultDuration for OneFacetIsResolved test.
      SERVER-59375 SERVER-62373 additional serverStatus sections for health…
      SERVER-62465: After intensities are updated, the resulting health che…
      SERVER-63110 manually fixed incompatibilities caused by backport from…
      SERVER-59391 fault if LDAP facets are enabled but misconfigured
      SERVER-62312 health monitoring documentation
      SERVER-62904: Fault Manager progress checker should not fault unless …

      Ticket list for the Enterprise module:

      SERVER-59912 Initial scaffolding and self registration of the Ldap health checker
      SERVER-60084 Fix clang related compile failure in Enterprise Ldap
      SERVER-60079 Make health checkers asynchronous, block check until previous is done
      SERVER-59366 Progress monitor for periodic health check
      SERVER-59386 Ldap health checker
      SERVER-61368 SERVER-61315 Ldap health check executor should support aborted tasks; test refactorings
      SERVER-61220 Ldap health checker linked into mongos
      SERVER-60846 replace double severity with enum type
      SERVER-59391 fault if LDAP facets are enabled but misconfigured
      SERVER-63110 reconsiled differences from head during 5.0 backport

            Assignee:
            Andrew Shuvalov (Inactive)
            Reporter:
            Andrew Shuvalov (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: