[SERVER-80764] [CQF] Sample in chunks Created: 05/Sep/23  Updated: 29/Oct/23  Resolved: 03/Oct/23

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: 7.2.0-rc0

Type: Improvement Priority: Major - P3
Reporter: Svilen Mihaylov (Inactive) Assignee: Daniel Segel
Resolution: Fixed Votes: 0
Labels: M1
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: File sample_chunks.js    
Backwards Compatibility: Fully Compatible
Participants:

 Description   

For sampling queries, do not perform a fully randomized sample, but instead for example perform 5 random contiguous chunks for 200 consecutive documents each.



 Comments   
Comment by Githook User [ 02/Oct/23 ]

Author:

{'name': 'Daniel Segel', 'email': 'daniel.segel@mongodb.com', 'username': 'dhsegel'}

Message: SERVER-80764 Sample in chunks ABT modification
Branch: master
https://github.com/mongodb/mongo/commit/126cb73bee9d631d47d7f79d4c31419168ac0d10

Comment by Daniel Segel [ 02/Oct/23 ]

Testing accuracy using:
sample_chunks.js
Results:
https://docs.google.com/spreadsheets/d/1e15vDoMjwjB_n7_sXTM_3YbqlXase043LZvk3uinCwE/edit?usp=sharing

Generated at Thu Feb 08 06:44:31 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.