[SERVER-25818] Network timeout on Shard lead to 2 primarys Created: 26/Aug/16 Updated: 26/Aug/16 Resolved: 26/Aug/16 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Replication |
| Affects Version/s: | 3.2.8 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Stefan Stark | Assignee: | Kelsey Schubert |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Attachments: |
|
| Operating System: | ALL |
| Participants: |
| Description |
|
A network error lead to continuous elections for a few hours until the Shard finally broke. The main problem can be seen in the following log: |
| Comments |
| Comment by Kelsey Schubert [ 26/Aug/16 ] | ||||||||||
|
Thank you for reporting this issue. After examining the logs, the behavior that you observe is expected given the network conditions. I see that Cluster01_Shard0_Primary repeatedly loses its connection to the rest of the replica set. As a result, it must step down and then call for a priority takeover when it reconnects. To resolve this issue, I would recommend investigating the cause of the network errors. In the interim, you can set the two nodes to same priority, which will prevent elections from continually occurring. For MongoDB-related support discussion please post on the mongodb-users group or Stack Overflow with the mongodb tag. A question like this involving more discussion would be best posted on the mongodb-users group. Kind regards, | ||||||||||
| Comment by Ramon Fernandez Marina [ 26/Aug/16 ] | ||||||||||
|
Sorry you've run into this issue stefan.stark@qplix.com, and thanks for uploading the logs – we'll investigate. | ||||||||||
| Comment by Stefan Stark [ 26/Aug/16 ] | ||||||||||
|
Accidently saved ticket to early. More info: Primary and Secondary mongod clients lose sight of each other, both get elected to be primary. Once the network is restored the keep stepping down:
OS: Windows Server 2012 |