Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Gone away
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 4.0.6
Component/s: Index Maintenance, Querying
Labels:
- query-44-grooming

Assigned Teams:

Query Optimization
Confidence Status:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

In the following schema:

{
    "arr" : [ 
        "a", 
        "b", 
        "c"
    ],
    "field" : 1,
    "field_2" : 1
}

with the following index:

{
    "arr" : 1,
    "field" : 1,
    "field_2" : 1
}

If I use this query:

db.coll
    .find({
        $or: [
            {arr: 'a'},
            {arr: []}
        ]
    })
    .sort({field: 1})
    .explain()

The inputStages are separated to 3 stages - arr: [a, a], arr: [[], []], arr: [undefined, undefined].
This is a good scenario since these input stages are followed by a "SORT_MERGE" stage

BUT, if I use the following query:

db.coll
    .find({
        $or: [
            {arr: 'a'},
            {arr: []}
        ],
        field: 1
    })
    .sort({field_2: 1})
    .explain()

There are only 2 input stages - arr: [a, a], arr: [[undefined, undefined], [[] , []]] .
This results that an additional stage needs to happen in order to FETCH the empty array docs and then it cannot use the SORT_MERGE stage.

In large collection, this causes the following error:
"Sort operation used more than the maximum 33554432 bytes of RAM. Add an index, or specify a smaller limit
Even though there is an index for this query.

I would expect the second scenario to perform like the first one - separating the input stages to 3 stages and to use the SORT_MERGE function.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

good_scenario.js
6 kB
May 14 2019 04:04:13 PM UTC
bad_scenario.js
5 kB
May 14 2019 04:04:14 PM UTC

is related to

SERVER-24518 MERGE_SORT_STAGE can be used more aggressively when OR_STAGE index sort orders match

Backlog

SERVER-19972 Passing empty array to $in should result in an error

Closed

Assignee:: [DO NOT USE] Backlog - Query Optimization

Reporter:: Tom Grossman

Participants:: [DO NOT USE] Backlog - Query Optimization, Asya Kamsky, Chris Harris, Eric Sedor, Tom Grossman

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Created:: May 14 2019 04:02:05 PM UTC

Updated:: Jul 03 2024 04:21:09 PM UTC

Resolved:: Jul 03 2024 04:21:09 PM UTC

GA Target Date:: None

Public Preview Target Date:: None

Private Preview Target Date:: None

Experiment Target Date:: None

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates