Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Duplicate
Priority: Critical - P2
Fix Version/s: None
Affects Version/s: 3.2.1
Component/s: WiredTiger
Labels:
- WTmem
- WTplaybook

Operating System:
ALL
Sprint:
Integration 11 (03/14/16)
Linked BF Score:
0
Confidence Status:
None
Work Order:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Issue Status as of Sep 30, 2016

ISSUE SUMMARY
MongoDB with WiredTiger may experience excessive memory fragmentation. This was mainly caused by the difference between the way dirty and clean data is represented in WiredTiger. Dirty data involves smaller allocations (at the size of individual documents and index entries), and in the background that is rewritten into page images (typically 16-32KB). In 3.2.10 and above (and 3.3.11 and above), the WiredTiger storage engine only allows 20% of the cache to become dirty. Eviction works in the background to write dirty data and keep the cache from being filled with small allocations.

That changes in ~~WT-2665~~ and ~~WT-2764~~ limit the overhead from tcmalloc caching and fragmentation to 20% of the cache size (from fragmentation) plus 1GB of cached free memory with default settings.

USER IMPACT
Memory fragmentation caused MongoDB to use more memory than expected, leading to swapping and/or out-of-memory errors.

WORKAROUNDS
Configure a smaller WiredTiger cache than the default.

AFFECTED VERSIONS
MongoDB 3.0.0 to 3.2.9 with WiredTiger.

FIX VERSION
The fix is included in the 3.2.10 production release.

Numerous reports of mongod using excessive memory. This has been traced back to a combination of factors:

TCMalloc does not free from page heap
Fragmentation of spans due to varying allocation sizes
Current cache size limits enforce net of memory use, not including allocator overhead, which is often significantly less than total memory used. This is surprising and difficult for users to tune for appropriately

Issue #1 has a workaround by setting an environment variable (AGGRESSIVE_DECOMMIT), but may have a performance impact. Further investigation ongoing.
Issue #2 has fixes in place in v3.3.5.
Issue #3 will likely be addressed by making the WiredTiger engine aware of memory allocation overhead, and tuning cache usage accordingly. (Need reference to WT ticket)

Regression tests for memory usage are being tracked here: ~~SERVER-23333~~

Original Description
While loading data into mongo, each of the 3 primaries crashed with memory allocation issues. As data keeps loading, new primaries are elected. Eventually it looks like they come down as well. Some nodes have recovered and have come back up, but new ones keep coming down. Logs and diagnostic attached

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

metrics.2016-02-29T09-36-53Z-00000.gz
Mar 01 2016 01:28:37 PM UTC
9.93 MB
James Mangold
metrics.2016-02-29T21-22-32Z-00000.gz
Mar 01 2016 01:28:37 PM UTC
8.61 MB
James Mangold
metrics.2016-03-01T06-54-27Z-00000.gz
Mar 01 2016 01:28:37 PM UTC
4.17 MB
James Mangold
mongodb.log.2016-03-01T06-51-04.gz
Mar 01 2016 01:28:37 PM UTC
95.01 MB
James Mangold
fragmentation.png
Mar 01 2016 03:48:12 PM UTC
286 kB
Bruce Lucas
fragmentation-repro.png
Mar 10 2016 03:56:41 PM UTC
79 kB
Bruce Lucas
fragmentation-repro-aggressive-decommit.png
Mar 11 2016 04:47:37 PM UTC
143 kB
Bruce Lucas
tcmalloc_aggressive_decommit.png
Apr 29 2016 04:16:40 PM UTC
152 kB
Christian Bayer
diagnostic.data-326.png
May 03 2016 03:16:31 PM UTC
158 kB
Bruce Lucas
diagnostic.data-335.png
May 03 2016 03:16:31 PM UTC
166 kB
Bruce Lucas
Screen Shot 2016-05-05 at 10.44.34 AM.png
May 05 2016 02:54:15 PM UTC
123 kB
Matthew Clark
diagnostic.data.tgz
May 05 2016 03:39:31 PM UTC
19.80 MB
Matthew Clark

depends on

SERVER-23333 Regression test for allocator fragmentation

Closed

duplicates

WT-2764 Optimize checkpoints to reduce throughput disruption

Closed

is duplicated by

SERVER-26312 Multiple high memory usage alerts, MongoDB using 75% of memory for small data size

Closed

is related to

SERVER-20306 75% excess memory usage under WiredTiger during stress test

Closed

related to

SERVER-23069 Improve tcmalloc freelist statistics

Closed

SERVER-24303 Enable tcmalloc aggressive decommit by default

Closed

(1 related to)

Assignee:: Alexander Gorrod
Reporter:: James Mangold
Participants:: Alexander Gorrod, Bruce Lucas, Christian Bayer, Daniel Pasette, Githook User, James Mangold, Martin Bligh, Matthew Clark, Michael Cahill, Ramon Fernandez Marina, Stephen JANNIN
Votes:: 9 Vote for this issue
Watchers:: 33 Start watching this issue

Created:: Mar 01 2016 01:28:37 PM UTC
Updated:: Sep 30 2016 01:14:12 PM UTC
Resolved:: Sep 29 2016 02:17:53 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates