Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 4.2.4, 3.6.18, 4.3.4, 4.0.17
Affects Version/s: None
Component/s: Aggregation Framework
Labels:
- qexec-team

Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Backport Requested:

v4.2, v4.0, v3.6
Steps To Reproduce:
Hide

Start a mongod server with the WT cache size configured to 0.2 GB using --wiredTigerCacheSizeGB=0.2. Insert 100,000,000 identical documents, each approximately 300 bytes. Then run the following two queries, and monitor memory consumption:

// This simple collection scan query should warm the cache, and thus should end up resulting in ~0.2GB of memory used. db.coll.find().itcount(); // In contrast, this query also needs to scan the collection. But it ends up using ~1 GB of memory, indicating that the system is unnecessarily consuming lots of memory outside the WT cache. db.coll.aggregate([{$match: {nonExistent: {$exists: false}}}, {$group: {_id: null, count: {$sum: 1}}}]).toArray();
Show
Start a mongod server with the WT cache size configured to 0.2 GB using --wiredTigerCacheSizeGB=0.2 . Insert 100,000,000 identical documents, each approximately 300 bytes. Then run the following two queries, and monitor memory consumption: // This simple collection scan query should warm the cache, and thus should end up resulting in ~0.2GB of memory used. db.coll.find().itcount(); // In contrast, this query also needs to scan the collection. But it ends up using ~1 GB of memory, indicating that the system is unnecessarily consuming lots of memory outside the WT cache. db.coll.aggregate([{$match: {nonExistent: {$exists: false }}}, {$group: {_id: null , count: {$sum: 1}}}]).toArray();
Sprint:
Query 2020-02-24
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

During query execution, when documents pass between the PlanStage tree and the pipeline of DocumentSources, they are first buffered in batches using a std::deque by the $cursor stage. The size of the batches is controlled by the internalDocumentSourceCursorBatchSizeBytes setParameter, which defaults to 4MB.

For count-like aggregation queries, this 4MB limit is not respected, leading to unbounded memory consumption. See the repro steps below for an example "count-like" query. In this query, the aggregation pipeline is responsible only for counting documents and does not actually require any of the data fields to be propagated from the PlanStage tree to the DocumentSource pipeline. This is implemented by pushing empty Documents onto the $cursor stage's std::deque. When the memory accounting code attempts to incorporate the size of these empty Documents, it calls Document::getApproximateSize(). This ends up having no effect, because Document::getApproximateSize() returns 0 for empty Documents. As a result, the std::deque of empty Document is allowed to grow without bound. In the repro described below, the deque becomes millions of elements long and consumes close to 1GB of memory.

In order to fix this we could explore a few approaches:

Fix the memory accounting code to include the size of the Document itself, not just the DocumentStorage. Also account for any additional memory consumed by the std::deque.
Change how count-like aggregates execute to avoid creating a large deque of empty documents. Theoretically, this buffering is unnecessary. We could simply discard a matching document and simultaneously increment the counter inside the $sum accumulator.

Assignee:: David Storch
Reporter:: David Storch
Participants:: David Storch, Githook User
Votes:: 0 Vote for this issue
Watchers:: 10 Start watching this issue

Created:: Jan 08 2020 06:00:01 PM UTC
Updated:: Oct 29 2023 10:13:34 PM UTC
Resolved:: Feb 19 2020 11:52:47 PM UTC
Confidence Status Last Update:: 12/Feb/20 11:16 PM

Details

Description

Attachments

Activity

People

Dates