[SERVER-14151] Election time increases in case of frequent stepdown Created: 03/Jun/14 Updated: 07/Oct/15 Resolved: 21/Sep/15 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Replication |
| Affects Version/s: | 2.7.0 |
| Fix Version/s: | 3.1.9 |
| Type: | Bug | Priority: | Critical - P2 |
| Reporter: | Davide Italiano | Assignee: | Eric Milkie |
| Resolution: | Done | Votes: | 0 |
| Labels: | 28qa | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Environment: |
Linux localhost.localdomain 3.14.4-200.fc20.x86_64 #1 SMP Tue May 13 13:51:08 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux |
||
| Issue Links: |
|
||||
| Backwards Compatibility: | Fully Compatible | ||||
| Operating System: | ALL | ||||
| Participants: | |||||
| Description |
|
To reproduce, get Analysis: Whenever the "health thread" gets new results, msgCheckNewState() is called. The code in _yea() looks like this:
Under some cirumstances, if the stepDown period is too low, the The exception will be then propagated to the caller and it'll result
causing a delay in the election. Changing LeaseTime to a smaller value hides the problem/makes the problem disappear but exposes some more subtle issues as the one reported int |
| Comments |
| Comment by Eric Milkie [ 21/Sep/15 ] |
|
With the new election protocol enhancements, this problem is no longer an issue. |
| Comment by Eric Milkie [ 17/Jul/14 ] |
|
We'll take a look at this after the replication refactoring is complete. |