[SERVER-80733] [CQF] Sample two fields at a time Created: 05/Sep/23  Updated: 10/Nov/23  Resolved: 07/Nov/23

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: 7.3.0-rc0

Type: Improvement Priority: Major - P3
Reporter: Svilen Mihaylov (Inactive) Assignee: Daniel Segel
Resolution: Fixed Votes: 0
Labels: M1
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Assigned Teams:
Query Optimization
Backwards Compatibility: Fully Compatible
Sprint: QO 2023-10-16, QO 2023-10-30, QO 2023-11-13
Participants:

 Description   

For sargable nodes, identify promising pairs of fields and sample them together.



 Comments   
Comment by Githook User [ 07/Nov/23 ]

Author:

{'name': 'Daniel Segel', 'email': 'daniel.segel@mongodb.com', 'username': 'dhsegel'}

Message: SERVER-80733 Sample two fields at a time using most common fields or most common pair
Branch: master
https://github.com/mongodb/mongo/commit/3123d4df5d434fbe71e8e36662f2acd098c3ed1f

Comment by Peter Volk [ 29/Sep/23 ]

When doing this you will need to consider the statistical independence of the two fields. Sampling each of the fields independently and sampling them together might result in different results if the fields are correlated with each-other. This should be verified by someone very firm in statistics and probability theory before implementing as this can have a huge impact on the accuracy of the CE. 

Generated at Thu Feb 08 06:44:26 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.