Loading...

XML

Word

Printable

JSON

Type: New Feature
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: 3.4.3
Component/s: Aggregation Framework
Labels:
None

Assigned Teams:

Query Optimization
Case:
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

The Spark connector uses the Aggregation Framework to create data partitions that are sent to Spark workers. In a sharded cluster it makes sense to align these partitions to chunk boundaries so that each worker's data loading query is targeted to a single shard.

This however is impossible when the shard key is a hashed index. A simple find can use $min / $max but there is no comparable facility in the aggregation framework.

is related to

SPARK-98 MongoShardedPartitioner and hashed shard keys not working correctly

Closed

SERVER-14400 Using $min and $max on shard key doesn't target queries

Backlog

related to

SERVER-24274 Create a command to provide query bounds for partitioning data in a collection

Backlog

Assignee:: [DO NOT USE] Backlog - Query Optimization
Reporter:: Sylvain Chambon (Inactive)
Participants:: [DO NOT USE] Backlog - Query Optimization, Andrew Ryder, Kelsey Schubert, Sylvain Chambon
Votes:: 5 Vote for this issue
Watchers:: 16 Start watching this issue

Created:: Apr 07 2017 11:21:35 AM UTC
Updated:: Dec 06 2022 04:03:59 AM UTC

Details

Description

Attachments

Issue Links

Activity

People

Dates