Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- auto-reverted

Assigned Teams:

Query Optimization
Linked BF Score:
0
Confidence Status:
None
Work Order:
3
Size Category:
TBD

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The current approach used by CBR sampling CE to estimate IntervalBounds is to scan the sample and generate keys out of it on each estimation call. This is very inefficient.

A potentially much more efficient approach is to transform each interval into a MatchExpression, and apply the expression directly to the sample. This task is about:

Create equivalent MatchExpression (ME) from an IntervalBounds.
Replace the calls to
SamplingEstimator::estimateRIDs with SamplingEstimator::estimateCardinality.
Benchmark and compare the performance of interval estimation via SamplingEstimator::estimateRIDs as a baseline, and compare that to the new estimation method based on Interval->ME conversion. Benchmarking can be done at least at two levels:
- Microbenchmark estimation itself excluding/including the conversion Interval->ME, and
- Run Genny multi-planner benchmarks (or similar).

is depended on by

SERVER-105980 Performance benchmark for sampling cardinality estimation over index bounds

In Progress

related to

SERVER-105939 Create MatchExpressions for special cases of index bounds intervals

Needs Scheduling

Assignee:: Milena Ivanova
Reporter:: Timour Katchaounov
Participants:: Githook User, Milena Ivanova, Mothra Jira Bot, Timour Katchaounov
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Apr 30 2025 08:45:21 AM UTC
Updated:: Jun 20 2025 03:56:38 PM UTC
Confidence Status Last Update:: 30/Apr/25 10:08 AM

Details

Description

Attachments

Issue Links

Activity

People

Dates