-
Type: Improvement
-
Resolution: Fixed
-
Priority: Major - P3
-
Affects Version/s: None
-
Component/s: Query Execution, Query Planning
-
None
-
Minor Change
-
QE 2022-04-04, QE 2022-02-21, QE 2022-03-07, QE 2022-03-21
SERVER-62150 describes a scenario where SBE multi-planning can be slow relative to the classic engine's multi-planning implementation. We implemented SERVER-62981 in order to mitigate this issue, and also have proposed SERVER-63641 as an additional improvement. In order to make sure that customers are experiencing good SBE multi-planner performance, we should add metrics to serverStatus. Before implementing this ticket, we need to agree on exactly what metrics to capture and how they will be exposed in serverStatus. The current proposal is to collect histograms of both the number of storage reads performed during SBE multi-planning and the overall wall clock time spent multi-planning.
We may wish to collect similar information for the classic multi-planner as well as the SBE multi-planner. There are known scenarios in which the classic multi-planner can take a long time to complete. In particular, see SERVER-31078.
The intended audience of these metrics is query engineering and query product management. We want to be able to analyze the performance of multi-planning across the Atlas fleet in order to inform our decision making about future improvements to the server. It's probable that these metrics would also be useful in support scenarios (e.g. seeing if a customer is getting a lot of queries which take a long time to multi-plan), but this is not the primary use case.
- causes
-
SERVER-65984 Move ServerStatusMetricField specialization out of histogram.h
- Closed
- depends on
-
SERVER-65271 serverStatus should allow fine-grained metrics exclusion
- Closed
- is related to
-
SERVER-62150 SBE Multiplanning can be slow when suboptimal plan runs first
- Closed
-
SERVER-31078 Query planning is very slow during multiplanning when result set is empty
- Backlog
-
SERVER-62981 Make SBE multi-planner's trial period termination condition independent of collection size
- Closed
-
SERVER-63015 Capture metrics about time spent multiplanning in slow query logs
- Closed
-
SERVER-63641 Improve SBE multi-planning by choosing which plan to work next based on a priority metric
- Closed
- related to
-
SERVER-63641 Improve SBE multi-planning by choosing which plan to work next based on a priority metric
- Closed