Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 9.0.0-rc0
Affects Version/s: None
Component/s: None
Labels:
- M1
- ce_accuracy

Assigned Teams:

Query Optimization
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Steps To Reproduce:

Hide

See body

Show
See body
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

In ~~SERVER-98102~~, the cost formula was apparently changed so that, in the presence of index {a: 1} and index {a:1, b:1} the right index will be chosen based on cost.

However, if the cardinality estimate is zero, both of those indexes continue to have the same almost-zero cost. This means that the first enumerated index will be picked, but the plan enumerator enumerates indexes based on their order of creation, which can change e.g. because of a mongorestore.

Both indexes may get a cardinality estimate of zero due to the value not being present in the sample, but the indexes are not equivalent – one may perfectly match the predicate while the other may only match it partially. They should not have the same cost.

Consider this example:

db.adminCommand({setParameter: 1, featureFlagCostBasedRanker: true});
db.adminCommand({setParameter: 1, internalQueryCBRCEMode: "samplingCE"});

db.coll.drop();
db.coll.insert({c:1});
db.coll.createIndexes([{a:1}, {a:1, b:1}]);
db.coll.getIndexes();
db.coll.find({a:1, b:1}, {_id: 0, a: 1, b:1}).explain().queryPlanner.winningPlan;

db.coll.drop();
db.coll.insert({c:1});
db.coll.createIndexes([{a:1, b:1}, {a:1}]);
db.coll.getIndexes();
db.coll.find({a:1,b:1}, {_id: 0, a:1 , b:1}).explain().queryPlanner.winningPlan;

Depending on the order in which the indexes were created, the plan can be either:

{
  isCached: false,
  usedJoinOptimization: false,
  stage: 'PROJECTION_SIMPLE',
  costEstimate: 0.00001167,
  cardinalityEstimate: 0,
  estimatesMetadata: { ceSource: 'Metadata' },
  transformBy: { _id: 0, a: 1, b: 1 },
  inputStage: {
    usedJoinOptimization: false,
    stage: 'FETCH',
    costEstimate: 0.00001167,
    cardinalityEstimate: 0,
    estimatesMetadata: { ceSource: 'Metadata' },
    filter: { b: { '$eq': 1 } },
    nss: 'test.coll',
    inputStage: {
      usedJoinOptimization: false,
      stage: 'IXSCAN',
      costEstimate: 0.00001167,
      cardinalityEstimate: 0,
      estimatesMetadata: { ceSource: 'Metadata' },
      numKeysEstimate: 0,
      nss: 'test.coll',
      keyPattern: { a: 1 },
      indexName: 'a_1',
      isMultiKey: false,
      multiKeyPaths: { a: [] },
      isUnique: false,
      isSparse: false,
      isPartial: false,
      indexVersion: 2,
      direction: 'forward',
      indexBounds: { a: [ '[1, 1]' ] }
    }
  }
}

{
  isCached: false,
  usedJoinOptimization: false,
  stage: 'PROJECTION_COVERED',
  costEstimate: 0.00001167,
  cardinalityEstimate: 0,
  estimatesMetadata: { ceSource: 'Sampling' },
  transformBy: { _id: 0, a: 1, b: 1 },
  inputStage: {
    usedJoinOptimization: false,
    stage: 'IXSCAN',
    costEstimate: 0.00001167,
    cardinalityEstimate: 0,
    estimatesMetadata: { ceSource: 'Sampling' },
    numKeysEstimate: 0,
    nss: 'test.coll',
    keyPattern: { a: 1, b: 1 },
    indexName: 'a_1_b_1',
    isMultiKey: false,
    multiKeyPaths: { a: [], b: [] },
    isUnique: false,
    isSparse: false,
    isPartial: false,
    indexVersion: 2,
    direction: 'forward',
    indexBounds: { a: [ '[1, 1]' ], b: [ '[1, 1]' ] }
  }
}

Those plans may have the same estimate and the same cost, but they are in no shape or form equivalent in performance – one does a covering index scan, the other uses an IXSCAN on a potentially inferior index, followed by a FETCH. In the case the estimate was stale or incorrect, those two plans will have vastly different performance characteristics.

is depended on by

TOOLS-4157 Make mongorestore create indexes in deterministic order

Accepted

is duplicated by

SERVER-121388 Join optimization: join enumeration and plan selection should not use non-deterministic data structures

Closed

is related to

SERVER-98102 Take account the number of fields in the index when calculating the cost

Closed

SERVER-124954 Handle cardinalityEstimate of zero in the join optimizer

Closed

SERVER-97933 Resolve cost ties in a deterministic way

Needs Scheduling

related to

SERVER-121388 Join optimization: join enumeration and plan selection should not use non-deterministic data structures

Closed

SERVER-98102 Take account the number of fields in the index when calculating the cost

Closed

SERVER-125783 Preserve prefix-selectivity ordering in CardinalityEstimator::clampZeroEstimates

Needs Scheduling

(3 related to)

Assignee:: Timour Katchaounov
Reporter:: Philip Stoev
Participants:: Githook User, Philip Stoev, Timour Katchaounov
Votes:: 0 Vote for this issue
Watchers:: 5 Start watching this issue

Created:: Mar 31 2026 12:28:22 PM UTC
Updated:: May 10 2026 06:23:35 PM UTC
Resolved:: May 10 2026 06:23:35 PM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates