[SERVER-28227] ApplyBatchFinalizerForJournal] Got signal: 6 (Aborted) Created: 07/Mar/17 Updated: 31/May/17 Resolved: 21/Mar/17 |
|
| Status: | Closed |
| Project: | Core Server |
| Component/s: | Storage |
| Affects Version/s: | 3.4.2 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major - P3 |
| Reporter: | Robert Romano | Assignee: | Mark Agarunov |
| Resolution: | Done | Votes: | 0 |
| Labels: | None | ||
| Remaining Estimate: | Not Specified | ||
| Time Spent: | Not Specified | ||
| Original Estimate: | Not Specified | ||
| Environment: |
SUSE Enterprise Linux 12 SP2 |
||
| Operating System: | ALL |
| Participants: |
| Description |
|
| Comments |
| Comment by Mark Agarunov [ 21/Mar/17 ] | ||||||||||||||||||||||
|
Hello rromano, We haven’t heard back from you for some time, so I’m going to mark this ticket as resolved. If this is still an issue for you, please provide additional information and we will reopen the ticket. Thanks, | ||||||||||||||||||||||
| Comment by Mark Agarunov [ 10/Mar/17 ] | ||||||||||||||||||||||
|
Hello rromano, Thank you for the report. To better investigate the behavior you've described, I'd like to request some additional information. Could you please provide the following:
I've created an upload portal so that you can securely send us these files. Additionally, it appears that the three logs provided are timestamped a few days apart. Are the timestamps on the logs accurate, and if so did this issue occur multiple times or was there a delay between the primary and the secondary crashing? Thanks, | ||||||||||||||||||||||
| Comment by Robert Romano [ 07/Mar/17 ] | ||||||||||||||||||||||
|
I just noticed "No space left on device" in the stack trace, so this may be a user error. Still, it would be nice if another replica could have remained up as primary. | ||||||||||||||||||||||
| Comment by Robert Romano [ 07/Mar/17 ] | ||||||||||||||||||||||
|
When the primary crashed, one of the secondaries also crashed with this log entry:
The last replica remained running and remained in SECONDARY state with these log entries being repeated repeatedly without end:
The log entry "Not starting an election, since we are not electable due to: Not standing for election because I cannot see a majority (mask 0x1)" is alarming. The last standing replica never took up role as primary, so cluster remained down. Very sad! |