Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Diagnostics, Replication
Labels:
- elections
Environment:
64-bit Linux, server 2.4.x, replica set

Assigned Teams:

Replication
Case:
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When a replica has disk space issues it does not cause it to shut down, nor in the case of a primary, to step down. Instead the server periodically tests to see if enough space has been freed up to continue.

As a result, the only indication of a problem is additional writes cause user asserts on the primary, and there's likely some introduction of repl lag on secondaries. But, from the perspective of rs.status() and db.serverStatus() everything looks fine (except for any introduced asserts/lag).

Some options mentioned in discussion with server team:

Have replset member step down (if primary)
Have replset member enter maintenance status (until disk space is avail)
Add warning message to [startup]warning log

Bonus: would be great if there was an explicit state/status change that could be picked up and reported by MMS. The last option should work for that.

is duplicated by

SERVER-10634 Failover doesn't occur on disk full and other non-crash errors

Closed

is related to

SERVER-14139 Disk failure on one node can (eventually) block a whole cluster

Closed

related to

SERVER-3759 filesystem ops may cause termination when no space left on device

Closed

SERVER-22971 Operations on some sharded collections fail with bogus error

Closed

SERVER-17230 Replica set Primary should step down if Out of file descriptors

In Progress

Assignee:: [DO NOT USE] Backlog - Replication Team
Reporter:: John Morales (Inactive)
Participants:: [DO NOT USE] Backlog - Replication Team, Anne Moroney, Daniel Watrous, Henrik Ingo, Ian Bentley, John Morales
Votes:: 10 Vote for this issue
Watchers:: 18 Start watching this issue

Created:: May 03 2013 03:14:01 PM UTC
Updated:: Dec 06 2022 05:21:30 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates