Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Aggregation Framework
Labels:
None

Assigned Teams:

Query Optimization
Sprint:
QO 2022-09-05, QO 2022-10-03, QE 2022-10-17

When generating pipelines to handle polymorphic data, the BI-Connector creates a lot of fields that are not necessarily needed for the final output, as a simplified example:

{$addFields: {x: ..., y:..., z:...}}
{$project{OUT: "$x"}}

The code for computing y and z, which can be quite expensive, is totally unneeded. This can generalize to any number of stages. Essentially, any fields removed by $projects can be removed from computations proceeding said $project, transitively, e.g.:

{$addFields: {x: ..., y:..., z:...}}
{$addFields: {a: {$add: ["$x", "$y"]}, b: {$add: ["$y", "$z"]}}}
{$project{OUT: "$b"}}

Here, we can remove the computation for a, and then, transitively, x. This has the possibility to drastically improve many types of queries, but will require a field level dependency tracker. This will see benefit for any generated code, not just from the BI-Connector, so it makes more sense to be done within the server.

is related to

SERVER-69361 [CQF] Extend path fusion to better handle field dependencies

Closed

related to

SERVER-69361 [CQF] Extend path fusion to better handle field dependencies

Closed

Assignee:: [DO NOT USE] Backlog - Query Optimization

Reporter:: Patrick Meredith

Participants:: [DO NOT USE] Backlog - Query Optimization, Patrick Meredith, Svilen Mihaylov

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: Nov 15 2018 03:58:58 PM UTC

Updated:: Dec 09 2022 09:08:28 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates