[SERVER-85538] Improve the performance of findUnsplitableCollections in move_unsharded_policy Created: 22/Jan/24  Updated: 25/Jan/24

Status: Backlog
Project: Core Server
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Improvement Priority: Major - P3
Reporter: David Dominguez Sal Assignee: Backlog - Catalog and Routing
Resolution: Unresolved Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Issue Links:
Related
is related to SERVER-84094 Implement initial drain policy for un... Backlog
Assigned Teams:
Catalog and Routing
Participants:

 Description   

The current implementation performs the location of unsplitable collections in two steps. First, it gets all the chunks, and then it validates which chunk belongs to an unsplitable database. Since the current balancer issues unsplitable migrations one by one, the system must do a linear operation on the chunks to get each migration. 

It would be better to do an aggregation that is aware of the limit, and does not perform a full scan of the chunks before starting the join with collections (i.e. not hash joined based)  


Generated at Thu Feb 08 06:57:59 UTC 2024 using Jira 9.7.1#970001-sha1:2222b88b221c4928ef0de3161136cc90c8356a66.