-
Type: Improvement
-
Resolution: Unresolved
-
Priority: Major - P3
-
None
-
Affects Version/s: None
-
Component/s: Aggregation Framework
-
Query Execution
Aggregation should bound total memory usage for each pipeline, not like each individual stage. Using multiple stages that can each use up to 100MBs can result in overall very large memory usage.
Original Description:
The agg framework should bound all memory usage, not just the sort stage. Using very large amounts of data in $group stage, for instance, can result in 26G of non-mapped virtual memory for a single agg job.