-
Type: New Feature
-
Resolution: Fixed
-
Priority: Critical - P2
-
Affects Version/s: None
-
Component/s: Networking
-
None
-
Fully Compatible
-
v4.2, v4.0
-
Sharding 2021-05-17
-
(copied to CRM)
Currently, a mongos node can send a hello request to replica set members and not hear a response indefinitely. In this case, the operation will not return until the connection on the mongos side has a timeout, which could be several minutes based on TCP keepalive settings.
This ticket is to create an application timeout mechanism that allows the RSM to make progress monitoring other nodes in the presence of TCP blackholes or similar network failures. The timeout should be on the order of seconds to ensure cluster availability.
- related to
-
SERVER-56917 Stuck Hello request may lead to cluster outage
- Closed