[SERVER-78265] Make analyzeShardKey command support sampling documents Created: 21/Jun/23  Updated: 29/Oct/23  Resolved: 28/Jun/23

Status: Closed
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: 7.1.0-rc0, 7.0.0-rc7

Type: Task Priority: Major - P3
Reporter: Cheahuychou Mao Assignee: Cheahuychou Mao
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Backports
Related
is related to MONGOSH-1499 Make analyzeShardKey shell helper sup... Closed
Backwards Compatibility: Fully Compatible
Backport Requested:
v7.0
Sprint: Sharding NYC 2023-06-26, Sharding NYC 2023-07-10
Participants:

 Description   

According to the experiments in SERVER-68763, the aggregate command run by the analyzeShardKey command to calculate the metrics about the characteristics about the shard key can take up to hours to run if the collection contains hundreds of millions of documents and the cardinality of the shard key is also very large. Given this, we should make the command support calculating metrics based on sampled documents instead of all of documents in the collection.



 Comments   
Comment by Githook User [ 03/Jul/23 ]

Author:

{'name': 'Cheahuychou Mao', 'email': 'mao.cheahuychou@gmail.com', 'username': 'cheahuychou'}

Message: SERVER-78265 Make analyzeShardKey command support sampling documents
Branch: EVG-17874-taskgen-test
https://github.com/mongodb/mongo/commit/41619ff10085f334c7bcd43ccc1bfcbc35f75360

Comment by Githook User [ 29/Jun/23 ]

Author:

{'name': 'Cheahuychou Mao', 'email': 'mao.cheahuychou@gmail.com', 'username': 'cheahuychou'}

Message: SERVER-78265 Make analyzeShardKey command support sampling documents

(cherry picked from commit 41619ff10085f334c7bcd43ccc1bfcbc35f75360)
Branch: v7.0
https://github.com/mongodb/mongo/commit/d327bd007fc8309af64912c6a8394e7fecdecf14

Comment by Githook User [ 28/Jun/23 ]

Author:

{'name': 'Cheahuychou Mao', 'email': 'mao.cheahuychou@gmail.com', 'username': 'cheahuychou'}

Message: SERVER-78265 Make analyzeShardKey command support sampling documents
Branch: master
https://github.com/mongodb/mongo/commit/41619ff10085f334c7bcd43ccc1bfcbc35f75360

Comment by Githook User [ 27/Jun/23 ]

Author:

{'name': 'Cheahuychou Mao', 'email': 'mao.cheahuychou@gmail.com', 'username': 'cheahuychou'}

Message: SERVER-78265 Move the helpers for the key characteristics tests into util files

(cherry picked from commit 99ca585727ee0fa75c93d62cfaa3f09c2708a88c)
Branch: v7.0
https://github.com/mongodb/mongo/commit/5432b4be086054f61bc9b25d31aa8384e63d35d7

Comment by Githook User [ 23/Jun/23 ]

Author:

{'name': 'Cheahuychou Mao', 'email': 'mao.cheahuychou@gmail.com', 'username': 'cheahuychou'}

Message: SERVER-78265 Move the helpers for the key characteristics tests into util files
Branch: master
https://github.com/mongodb/mongo/commit/99ca585727ee0fa75c93d62cfaa3f09c2708a88c

Generated at Thu Feb 08 06:37:53 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.