Loading...

XML

Word

Printable

JSON

Type: Bug
Resolution: Fixed
Priority: Blocker - P1
Fix Version/s: 7.1.0-rc0, 7.0.0-rc8
Affects Version/s: 6.3.1, 7.0.0-rc7
Component/s: None
Labels:
- auto-reverted

Assigned Teams:

Query Execution
Backwards Compatibility:
Fully Compatible
Operating System:
ALL
Sprint:
QE 2023-07-24
Case:
Linked BF Score:
167
Confidence Status:
None
Work Order:
0
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After analysis the following root cause is two fold:

1) The Parameterization of a query abstracts dependencies between two $or branches in such a way that dependencies are lost. A query containing an $or statement with predicates on the same field of the same value are considered to be the same as a query where the 2 filters did not have the same value. Both queries would result in the same QueryShape which is semantically incorrect as the dependency are used in query optimizations. These two queries should have two different planhashes

2) While checking for equality of 2 $or branches containing and IndexScan the equality function was missing to check the the IETs and 2 falsely considered equal IndexScans were merged together leading to a correctness issue.

Hi Team,

I have identified an issue with the Slot-based Execution Engine where the Query Planner is selecting the wrong plan for a find() command (see details below), resulting in data inconsistency. In summary, it's implicitly adding a limit(1) to a query, causing the query to always return 1 document instead of the actual number of matched documents (2 expected). Before we found the actual culprit, one workaround was to do a test failover in Atlas also which would resolve the issue temporarily.

Scenario:

We created the corresponding aggregation pipeline for the same find() query, and it returned 2 documents as expected.
If we switch the position of any field in the query filter, it returns 2 documents as expected.
We looked at the PlanCache of the collection and found two queries identical to the problematic one, except that one of them has "limit(1)" which is the one the query planner kept selecting although we had not explicitly used limit in the query. - the culprit for this issue.
Because of this cached plan, the number of returned documents was always 1.
We ran the PlanCache.clearPlansByQuery() command to remove the query, and that solved the issue.

I have also attached the Plan cache output which contains the bad plans along with the new plans (after we cleared the cache). Each section is labeled "old cache query plans" and "new cached query plans" respectively.

Please let me know if you need any additional information. Thank you for your help.

Best regards,
Reginald

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

old-and-new-query-plans.txt
10 kB
Jul 06 2023 10:47:34 PM UTC
summary_oldMethod
31 kB
Jul 19 2023 08:40:23 AM UTC
summary_withNewMethod
29 kB
Jul 19 2023 08:40:23 AM UTC

related to

SERVER-78962 Check on feasibility to perform boolean simplification before parameterization

Backlog

SERVER-79092 Optimize the expression search for parameter reuse during parameterization

Closed

Assignee:: David Storch
Reporter:: Reginald Chounoune
Participants:: David Storch, Githook User, Reginald Chounoune, xgen-buildbaron-user
Votes:: 0 Vote for this issue
Watchers:: 35 Start watching this issue

Created:: Jul 06 2023 10:44:20 PM UTC
Updated:: Oct 29 2023 09:19:08 PM UTC
Resolved:: Jul 20 2023 09:57:11 PM UTC
Confidence Status Last Update:: 19/Jul/23 7:59 PM

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates