Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Optimization
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

For example, consider a FETCH stage with the following filter:

{
  $match: {
    $and: [
      {
        $or: [
          { a: { $nin: [23565, 11, 9708, 4819] } },
          { a: false }
        ]
      },
      {
        b: { $lt: 1.13 },
        c: { $lte: 726 }
      },
      {
        $or: [
          { d: { $type: 'string' } },
          { e: { $regex: '^y' } }
        ]
      }
    ],
    f: { $lte: 1.15 },
    g: { $lte: { s: 1, b: 5 } }
  }
}

The filter expression has 8 leaf expressions. ~~SERVER-113632~~ calibrated the cost of evaluating a single leaf expression. A naive cost model could simply multiply that cost by 8 (the number of leaves). However, that will usually be a huge overestimation of the actual cost because of short circuiting.

When the first child expression of $and evaluates to false, none of the remaining child expressions need to be evaluated. Similarly for $or, once a child expression evaluates to true, none of the other child expressions need to be looked at. The more complex the filter, the unlikelier it becomes that each leaf expression will be evaluated.

The current costing logic only assigns the cost of 1 leaf expression even if there are more, because it turns out better plan choices are made when underestimating the filter cost than when overestimating.

However, there are many possible ways in which we can make a smarter estimation of the actual leaf evaluations. Since we have a selectivity estimate for each predicate, we basically just have to solve the probability problem to find E(number of leaves evaluated) where E is the expected value. We of course don't have perfect information, because we can't perfectly estimate selectivities for conjuncts or disjuncts.

is related to

SERVER-113632 Add incremental cost for every leaf in the filter expression

Closed

Assignee:: Unassigned
Reporter:: Max Verbinnen
Participants:: Max Verbinnen
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Nov 28 2025 05:01:25 PM UTC
Updated:: Dec 01 2025 03:48:10 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates