Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.0.12
Component/s: None
Labels:
- query-44-grooming

Assigned Teams:

Query Execution
Operating System:
ALL
Sprint:
QE 2022-10-17, QE 2022-10-31, QE 2022-11-14, QE 2022-11-28, QE 2022-12-12, QE 2022-12-26, QE 2023-01-09
Case:
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Each of the stages listed in the title keeps a set of RecordIds; these are used to identify seen documents in order to ensure that we do not return the same document twice to the user. However, this requires memory proportional to the number of documents processed, and nothing is in place to ensure that we do not consume too much. One example of how to reproduce this unbounded memory growth is given below.

40 M documents of the form {x:0, y:0}
index on {y:1}
query of the following form (full repro script attached)

        q = {$or: [{x: 0, y: 0}, {x: 0, y: 0}]}
        db.c.find(q).hint({y: 1}).sort({z: -1}).limit(30).itcount()

Heap profile call tree shows memory usage by OrStage::work grow to about 1.5 GB as it scans the collection, then drop back to 0 at conclusion of query. Graph in each row shows memory usage for that node and its descendants; second number in each row is max memory in MB for that node.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

repro.sh
1 kB
Jun 02 2016 06:26:57 PM UTC
memory-growth-calltree.png
170 kB
Jun 08 2016 11:35:27 AM UTC
heap-profile.png
229 kB
Jun 02 2016 06:26:57 PM UTC

is duplicated by

SERVER-36087 Executing $text statements in conjunction with 'count' or 'sort' provokes Out Of Memory

Closed

SERVER-123 multi-key _id deduping uses a lot of memory

Closed

is related to

SERVER-26534 Text search uses excessive memory

Closed

SERVER-111142 Add serverStatus metric that tracks memory used for RID deduping across all queries

Closed

related to

SERVER-82167 $regex may use unbounded cpu and memory

Open

SERVER-87134 Consider using roaring bitmaps for RecordId deduplication in IndexScan stages

Closed

SERVER-20239 Built-in sampling heap profiler

Closed

SERVER-36794 Non-blocking $text plans with just one search term do not need an OR stage

Closed

SERVER-111142 Add serverStatus metric that tracks memory used for RID deduping across all queries

Closed

split to

SERVER-97745 [CLASSIC] OrStage should spill if allowDiskUse is specified and memory usage has exceeded some limit

Backlog

SERVER-97746 [CLASSIC] MergeSortStage should spill if allowDiskUse is specified and memory usage has exceeded some limit

Backlog

SERVER-97747 [CLASSIC] IndexScan should spill if allowDisk is specified and memory usage has exceeded some limit

Backlog

(4 related to, 3 split to)

Assignee:: Unassigned
Reporter:: Bruce Lucas (Inactive)
Participants:: Bruce Lucas, David Storch, Kyle Suarez, Nimesh Shah
Votes:: 5 Vote for this issue
Watchers:: 57 Start watching this issue

Created:: Jun 02 2016 06:26:57 PM UTC
Updated:: May 06 2026 06:23:23 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates