[SERVER-17845] Running the dropDatabase command can cause an election to occur Created: 01/Apr/15 Updated: 03/Apr/15 Resolved: 03/Apr/15 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Replication |
| Affects Version/s: | 3.0.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Robert Stam | Assignee: | Amalia Hawkins |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Backwards Compatibility: | Fully Compatible | ||||||||
| Operating System: | Windows | ||||||||
| Sprint: | Security 1 04/03/15 | ||||||||
| Participants: | |||||||||
| Description |
|
We are encountering unexpected elections when running the .NET driver unit tests against a replica set (with auth and ssl if that makes a difference). Apparently running the dropDatabase command can cause an unexpected election to occur. From the attached logs: On member 27017:
And on member 27018:
So 27018 has taken the role of primary away from 27017. Here's what seems to be bogging member 27017 down so much that 27018 concluded it was down:
|
| Comments |
| Comment by Matt Dannenberg [ 02/Apr/15 ] |
|
Cherry-picking the commit from |
| Comment by Bernie Hackett [ 02/Apr/15 ] |
|
OIC, that's definitely not the problem then. |
| Comment by Jeffrey Yemin [ 01/Apr/15 ] |
|
This is happening on the dedicated Windows machines. I don't think there are disk space issues there. |
| Comment by Bernie Hackett [ 01/Apr/15 ] |
|
Jenkins has been having issues with running out of disk space recently. I wonder if this is related? |
| Comment by Robert Stam [ 01/Apr/15 ] |
|
I don't know. We run our tests on Jenkins and Jenkins is configured to run the entire replica set (and the unit tests) on a single machine. |
| Comment by Eric Milkie [ 01/Apr/15 ] |
|
Does the same problem happen if you run the test on separate physical machines for each node? I am curious if a lack of system resources is part of the issue. |