Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Minor - P4
Fix Version/s: 7.0.0-rc0
Affects Version/s: None
Component/s: Query Execution, Query Planning
Labels:
- pm2697-m2

Backwards Compatibility:
Fully Compatible
Sprint:
QE 2023-02-06, QE 2023-02-20, QE 2023-03-06
Story Points:
10
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

This improvement aims to improve performance for queries affected by ~~SERVER-62150~~. I recommend reviewing related ticket ~~SERVER-62150~~ for context.

Today, the SBE multi-planner works in the following way. Let's assume there are n candidate plans, and they have some arbitrary order. The algorithm, in pseudocode, looks like this:

maxReads <- internalQueryPlanEvaluationWorksSbe // Defaults to 10,000.
for (plan in plans) {
     numReadsDoneByPlan <- runTrialPeriod(plan, maxReads);
     maxReads <- min(maxReads, numReadsDoneByPlan);
}

The reads budget for the trial period is initialized to 10,000, the default value for the internalQueryPlanEvaluationWorksSbe query knob. Next, we run the trial period for each plan in turn (without any interleaving or round-robin execution). The trial period for a given plan ends if either 1) the plan reaches EOF, 2) the plan produces its first batch of results, or 3) the reads budget is expended. In cases (1) and (2), the number of reads required to reach either EOF or the first batch of results is returned as numReadsDoneByPlan in the pseudocode above. This becomes the new reads budget, which means that no plan's trial period is allowed to last longer than that of the previous plan.

The problem with this approach is that the overall work done during the trial period is very much dependent on the order in which the candidates are trialed. Let's say there is a bad plan which requires all 10,000 reads during the trial period and a good plan which reaches EOF in 10 reads. If the bad plan runs first, then the total reads during the trial period will be 10,000 + 10 = 10,010. If the good plan runs first, it immediately reduces the reads budget to 10, and the trial period costs just 10 + 10 = 20 reads in total.

This ticket proposes an alternative algorithm which aims to reduce the average cost of SBE multi-planning. Here is the updated pseudocode:

maxReads = internalQueryPlanEvaluationWorksSbe // Defaults to 10,000.
PriorityQueue q
q <- all candidates, initialized with the same default priority
while (q is not empty) {
    planToRun <- q.Pop()
    done, numReadsDoneByPlan, newPriority <- planToRun.GetNext()
    if (not done) {
        q.Push(planToRun, newPriority)
    } else {
        maxReads = min(maxReads, numReadsDoneByPlan)
    }
}

Here, the idea is to interleave execution of the candidate plans (which brings the SBE multi-planner a bit closer to the round-robin strategy employed by the classic multi-planner). We call getNext() just once on whichever plan is currently prioritized. The calculation of the priority is not spelled out in the pseudocode, but I imagine that the highest priority plan is the one which currently has the highest "productivity ratio": namely, the highest ratio of documents returned so far to storage reads. As in the current SBE multi-planning algorithm, when a plan's trial period completes, the reads budget is reduced for all plans which remain in the priority queue.

Credit to mihai.andrei for proposing this idea!

is related to

SERVER-62150 SBE Multiplanning can be slow when suboptimal plan runs first

Closed

SERVER-62981 Make SBE multi-planner's trial period termination condition independent of collection size

Closed

SERVER-63642 Add serverStatus metrics to measure multi-planning performance

Closed

related to

SERVER-78077 the log of getProductivityFormula function print inconsistency.

Closed

SERVER-63642 Add serverStatus metrics to measure multi-planning performance

Closed

Assignee:: Ivan Fefer
Reporter:: David Storch
Participants:: Ana Meza, David Storch, Githook User, Ivan Fefer, Mihai Andrei
Votes:: 0 Vote for this issue
Watchers:: 18 Start watching this issue

Created:: Feb 14 2022 10:56:54 PM UTC
Updated:: Oct 29 2023 09:42:30 PM UTC
Resolved:: Feb 28 2023 10:56:16 AM UTC
Confidence Status Last Update:: 16/Feb/23 4:36 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates