Major - P3
(copied to CRM)
Hope you all are doing well, recently we have faced an issue on our production system wherein mongos stops communicating with cluster and version we are using is 4.2.8.
wherein we tried login through mongos it was getting stuck on
Connecting to 127.0.0.1:27017 and nothing it was moving forward. where we checked the logs on mongos and we found something network error
NETWORK [conn1524750] DBException handling request, closing client connection: ClientDisconnect: operation was interrupted.
we tried restarting mongos and redeploying mongos but still the issue remains
but it was able to communicate when we perform telnet and everything on all the nodes in shards and config was intact.
and we checked the logs on the config server which was giving ssl handshake received but server is started without ssl support mongodb
but we are not using certificate/ssl based authentication it is simple key-file based authentication.
Later on we step down the primary on the config server and everything starts as normal after electing the new primary.
We found the similar scenarios wherein stepdown/restart config server fix the issue, so do we have any other fix apart from upgrade, this seems to be serious issue kindly let us know.
MonngoDB version we are using 4.2.8