[SERVER-32828] Connection storm on Primary, no election Created: 22/Jan/18  Updated: 06/Dec/22  Resolved: 23/Jan/18

Status: Closed
Project: Core Server
Component/s: Replication
Affects Version/s: 3.2.12
Fix Version/s: None

Type: Bug Priority: Major - P3
Reporter: Clare Scally Assignee: Backlog - Replication Team
Resolution: Duplicate Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Depends
Duplicate
duplicates SERVER-32270 Use shared NetworkInterfaceASIO in mo... Closed
Related
related to SERVER-32867 Tie liveness to the ability to replicate Backlog
Assigned Teams:
Replication
Operating System: ALL
Participants:
Case:

 Description   

In the scenario where a PRIMARY node is stormed with inbound connections and becomes inaccessible to inbound connections, is it possible that replication heartbeats continue and that no election is triggered?



 Comments   
Comment by Spencer Brody (Inactive) [ 23/Jan/18 ]

SERVER-32270 would have avoided this issue, since heartbeats and data replication would have been sharing the same connection pool and thus if heartbeats could get a connection, data replication would also be able to get a connection.

There's a broader conversation to be had here about what it means for a node to report itself as "alive". A node that can heartbeat but cannot replicate data, for any reason, should probably not be considered "alive". Figuring out a story for tying liveness monitoring in with the ability to replicate data would be very valuable, but is complex and would be a significant change to the current architecture. It's something I'd love to do, but I don't expect us to get to it in our current 3.7 development cycle.

For now, I'd like to close this ticket as a duplicate of SERVER-32270, which would address the specific networking issues brought up in this ticket, and I will file a separate ticket for a more comprehensive story around liveness monitoring.

Generated at Thu Feb 08 04:31:24 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.