Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: Aggregation Framework
Labels:
None

Assigned Teams:

Query Optimization
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Suppose your run the following aggregation:

db.test.aggregate([
  {$group: {
    _id: {
      date: {$dateToString: {date: "$_id", format: "%Y-%m-%d"}},
      region: "$region"
    },
    total: {$sum: 1}
  }},
  {$project: {_id: 0, region: "$_id.region", date: "$_id.date", total: 1}},
  {$out: {to: "sharded_by_region", mode: "replaceDocuments", uniqueKey: {region: 1, _id: 1}}}
])

Further suppose that the collection "sharded_by_region" has shard key {region: 1}. It looks like this pipeline is eligible for an $exchange optimization because all the way from the $group to the $out the shard key is preserved - it's just renamed from "_id.region" to top-level "region".

Unfortunately, our dependency/rename tracking will not consider this to be a strict rename, because it cannot figure out that "_id" won't be an array. If "_id" were an array, than the $project stage would be doing more than a rename, instead transforming the array previously stored in "_id" and storing the result of the transformation in "region" or "date" accordingly.

This use-case of using a $group with multiple group-by keys seems common enough for us to consider adding custom logic to communicate to the dependency/rename tracking system that we know that either (1) "_id" is not an array or (2) the pipeline will result in an error because the shard key and the uniqueKey cannot contain arrays.

Assignee:: [DO NOT USE] Backlog - Query Optimization
Reporter:: Charlie Swanson
Participants:: [DO NOT USE] Backlog - Query Optimization, Charlie Swanson
Votes:: 0 Vote for this issue
Watchers:: 4 Start watching this issue

Created:: Aug 21 2018 02:30:13 PM UTC
Updated:: Dec 06 2022 03:20:14 AM UTC

Details

Description

Attachments

Forms

Activity

People

Dates