Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Duplicate
Priority: Critical - P2
Fix Version/s: None
Affects Version/s: None
Component/s: Querying
Labels:
- bonsai
- query-44-grooming

Assigned Teams:

Query Optimization
Confidence Status:
None
Work Order:
0

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

We should consider the number of documents scanned between different plans being evaluated. At the very least, we should use that to resolve ties between plans.

This is critical for Big Data systems reading data via Spark Connector that partitions data by _id for reading. Example:

For the query

{date:{$gte:A}, _id:{$gte:B}, email:{$gte:C}}

all indexes below will tie, although it is beyond obvious which one should be selected (the difference is of course dramatic when we're talking about TB's of data):

{date:1, _id:1}
{date:1, _id:1, email:1}
{date:1, _id:1, some_other_email:1}

is duplicated by

SERVER-79400 Implement number of documents tie breaking heuristics

Closed

related to

SERVER-14423 Plans which fetch different numbers of documents can tie

Closed

Assignee:: [DO NOT USE] Backlog - Query Optimization
Reporter:: Alexander Komyagin (Inactive)
Participants:: [DO NOT USE] Backlog - Query Optimization, Alexander Ignatyev, Alexander Komyagin
Votes:: 2 Vote for this issue
Watchers:: 23 Start watching this issue

Created:: Sep 18 2018 09:25:41 PM UTC
Updated:: Sep 11 2023 10:38:17 AM UTC
Resolved:: Sep 11 2023 10:38:17 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates