Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.0.12
Component/s: None
Labels:
- query-44-grooming

Assigned Teams:

Query Execution
Operating System:
ALL
Sprint:
QE 2022-10-17, QE 2022-10-31, QE 2022-11-14, QE 2022-11-28, QE 2022-12-12, QE 2022-12-26, QE 2023-01-09
Case:
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

Each of the stages listed in the title keeps a set of RecordIds; these are used to identify seen documents in order to ensure that we do not return the same document twice to the user. However, this requires memory proportional to the number of documents processed, and nothing is in place to ensure that we do not consume too much. One example of how to reproduce this unbounded memory growth is given below.

40 M documents of the form {x:0, y:0}
index on {y:1}
query of the following form (full repro script attached)

        q = {$or: [{x: 0, y: 0}, {x: 0, y: 0}]}
        db.c.find(q).hint({y: 1}).sort({z: -1}).limit(30).itcount()

Heap profile call tree shows memory usage by OrStage::work grow to about 1.5 GB as it scans the collection, then drop back to 0 at conclusion of query. Graph in each row shows memory usage for that node and its descendants; second number in each row is max memory in MB for that node.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

heap-profile.png
229 kB
Jun 02 2016 06:26:57 PM UTC
memory-growth-calltree.png
170 kB
Jun 08 2016 11:35:27 AM UTC
repro.sh
1 kB
Jun 02 2016 06:26:57 PM UTC

depends on

SERVER-97745 OrStage should spill if allowDiskUse is specified and memory usage has exceeded some limit

Blocked

SERVER-97746 MergeSortStage should spill if allowDiskUse is specified and memory usage has exceeded some limit

Blocked

SERVER-97747 IndexScan should spill if allowDisk is specified and memory usage has exceeded some limit

Blocked

is duplicated by

SERVER-36087 Executing $text statements in conjunction with 'count' or 'sort' provokes Out Of Memory

Closed

SERVER-123 multi-key _id deduping uses a lot of memory

Closed

is related to

SERVER-26534 Text search uses excessive memory

Backlog

related to

SERVER-82167 $regex may use unbounded cpu and memory

Open

SERVER-87134 Consider using roaring bitmaps for RecordId deduplication in IndexScan stages

Closed

SERVER-20239 Built-in sampling heap profiler

Closed

SERVER-36794 Non-blocking $text plans with just one search term do not need an OR stage

Closed

(1 is related to, 4 related to)

Assignee:: [DO NOT USE] Backlog - Query Execution
Reporter:: Bruce Lucas (Inactive)
Participants:: [DO NOT USE] Backlog - Query Execution, Bruce Lucas, David Storch, Kyle Suarez, Nimesh Shah
Votes:: 5 Vote for this issue
Watchers:: 50 Start watching this issue

Created:: Jun 02 2016 06:26:57 PM UTC
Updated:: Feb 24 2025 10:29:47 AM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates