Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Won't Do
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Sharding
Labels:
None

Assigned Teams:

Sharding EMEA
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When a chunk move fails, mongo leaves behind files named "preCleanup.$timestamp.bson" inside the $DBPATH/moveChunk. Often times, the reason why the chunk move fails is not a transient condition, causing the move to fail again if attempted, until the root cause is fixed. When the balancer is enabled, it will choose to move the same chunk to the same destination over and over, failing each time, causing these preCleanup files to be placed on disk and never getting reaped. Over a short period of time (say, a day), this can easily use up all of the available inodes on that filesystem.

We had this happen over the weekend, and once all the inodes are used, the mongoD will exit and will fail to restart until there are available inodes again. This seems like non ideal behavior, and I think it would be much better if the preCleanup files would also get cleaned up after a failed chunk move instead of allowing them to accumulate on disk.

Assignee:: [DO NOT USE] Backlog - Sharding EMEA
Reporter:: Dai Shi
Participants:: [DO NOT USE] Backlog - Sharding EMEA, Dai Shi, Kaloian Manassiev, Kelsey Schubert
Votes:: 0 Vote for this issue
Watchers:: 9 Start watching this issue

Created:: Sep 07 2016 12:28:45 AM UTC
Updated:: Dec 06 2022 04:17:23 AM UTC
Resolved:: Dec 17 2021 02:43:39 PM UTC

Details

Description

Attachments

Activity

People

Dates