Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:

Assigned Teams:

Workload Scheduling
Operating System:
ALL
Sprint:
WS Prioritized List
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

When queries don't yield cooperatively for long periods of time, say due to blocking sorts and groups, execution control does not add tickets fast enough, or at all.

Imagine a workload where queries take longer than 1 second. The default probing interval is 200ms. We will never observe an increase in throughput even though progress is being made, and thus never increase concurrency.

We should consider the following improvements:

Adjust probing interval based on query latency

If the average query latency is 1 second, we should increase the probing interval to a similar order of magnitude so that our feedback loop captures queries as they complete. But as we scale the probing interval, we should proportionately scale the step size so that we increase tickets with the same velocity as a shorter interval.

Unconditionally add tickets when throughput is zero

If throughput is zero but tickets are maxed out, we should just unconditionally add tickets. When this happens, this means we have many high-latency queries in progress. As long as we have room to increase tickets, this should help ramp up tickets in a "cold start" scenario.

is related to

SERVER-86504 Better observability for operations which exceed ticket deadlines

Backlog

Assignee:: Unassigned
Reporter:: Louis Williams
Participants:: Louis Williams
Votes:: 0 Vote for this issue
Watchers:: 13 Start watching this issue

Created:: Mar 01 2024 09:51:00 PM UTC
Updated:: Mar 03 2025 04:33:40 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates