Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Minor - P4
Fix Version/s: 8.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:

Assigned Teams:

Query Optimization
Backwards Compatibility:
Fully Compatible
Linked BF Score:
200
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The DISTINCT_SCAN optimization can only be triggered if the entire pipeline and indexes need to have a specific shape and reference specific fields in a specific order.

This can be used as an exercise to construct such pipelines in PBT:

generate a list of field names
create indexes over said fields
$sort over that same list of fields
optionally $match over that same list of fields
use the first few columns for the _id field of the $group, and the remainder for the actual accumulators
Add additional stages, e.g. $project, etc. that would also be using the initial list of fields

Then enforce the property that the indexed plan returns the same results as the collection scan. Given the way the $sort and the $group are generated from the same input list of column names, the output will be deterministic

This can be further extended by adding additional stages, e.g. $project

is duplicated by

SERVER-72750 Grammar fuzzer changes - exercise DISTINCT_SCAN

Closed

is related to

SERVER-97668 Targeted property-based test (PBT) for explode-for-sort optimization

Backlog

Assignee:: Ben Shteinfeld
Reporter:: Philip Stoev
Participants:: Ben Shteinfeld, Githook User, Philip Stoev
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Sep 06 2024 12:45:40 PM UTC
Updated:: Nov 26 2024 10:20:55 PM UTC
Resolved:: Nov 18 2024 02:36:40 PM UTC
Confidence Status Last Update:: 01/Oct/24 3:36 PM

Details

Description

Attachments

Issue Links

Forms

Activity

People

Dates