Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.1.0-rc0
Affects Version/s: None
Component/s: None
Labels:
- M4

Assigned Teams:

Query Optimization
Backwards Compatibility:
Fully Compatible
Sprint:
QO 2025-02-03, QO 2025-02-17
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

When we issue a distinct() command, we always try to dedup the results we get back from the cursor inside the command logic, regardless of whether we already have duplicates or not (i.e. we could sometimes just directly return the output of DISTINCT_SCAN). Conversely, when we have an aggregation, the $groupByDistinct rewrite eliminates this excess work when we generate a DISTINCT_SCAN.

For this reason, we need to prioritize plans that use a DISTINCT_SCAN more highly than plans that don't when we use aggregations than when we use distinct() commands. This is because two plans may have an equivalent productivity measure, but NOT have similar work done on the output of the cursor. We may also want to consider eliminating index scan candidates completely for aggregations.

Assignee:: Alya Berciu
Reporter:: Alya Berciu
Participants:: Alya Berciu, Githook User
Votes:: 0 Vote for this issue
Watchers:: 2 Start watching this issue

Created:: Jan 20 2025 01:03:59 PM UTC
Updated:: Feb 05 2025 02:29:28 PM UTC
Resolved:: Feb 05 2025 02:26:52 PM UTC
Confidence Status Last Update:: 22/Jan/25 12:47 PM

Details

Description

Attachments

Forms

Activity

People

Dates