Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 9.0.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Optimization
Backwards Compatibility:
Fully Compatible
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In ~~SERVER-121256~~, we lowered the random I/O estimate of INLJ to be equal to the number of probes that will be performed (before the random I/O estimate was the number of documents returned). This is important because both collections and index entries in MongoDB are clustered by RID, meaning that when fetching all records for a single index key, the query engine fetches the records in RID order, which results in a "sorted and sparse" access pattern instead of a truly random IO access pattern.

However, the join cost model currently ignores the cost of this sorted-sparse IO. In the PR, we found a regression in Q7 of TPC-H (note this version excludes nation2 from the join graph) where we started choosing an INLJ for the supplier - lineitem join instead of a HJ. From the execution stats, we found that we were reading the entirety of the lineitem collection in the INLJ, which is clearly worse than performing a HJ. The LHS of the INLJ is HJ(nation, supplier) which returns 1000 documents, so our cost model estimates ~1000 random I/Os, but in reality this probe returns all 600k lineitems.

After incorporating this into the cost model, we should expect that for Q7 we will choose HJ(HJ(N, S), L) instead of INLJ(HJ(N, S), L).

is related to

SERVER-121256 [Join Optimization] INLJ costing - don't consider every probed document as a random IO

Closed

related to

SERVER-125042 TPC-H near-optimal plan investigation on SF 1

Open

SERVER-125415 Investigate TPC-H plan quality when data does not fit in cache

Backlog

SERVER-122916 [Join Optimization] Index scan costing - don't consider each fetched document a random IO

Closed

SERVER-124954 Handle cardinalityEstimate of zero in the join optimizer

Closed

Assignee:: Max Verbinnen
Reporter:: Ben Shteinfeld
Participants:: Ben Shteinfeld, Githook User, Max Verbinnen
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Mar 20 2026 02:47:07 PM UTC
Updated:: Apr 29 2026 05:25:40 PM UTC
Resolved:: Apr 29 2026 05:25:40 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates

PagerDuty