Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Done
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: MapReduce, Sharding
Labels:
- remove-distributed-lock-fallout

Assigned Teams:

Sharding
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Map/reduce with a sharded output collection performs an optimization where the final reduce step is omitted and the previous reduce step is done in parallel on all shards. This is done by creating an empty sharded collection and spreading the chunks so they are co-located with the data to be reduced based on the shard key, writing all output locally to an empty temporary collection and then renaming the temporary collection to the name of the output collection.

This process only works if no chunks of the output collection move around while the output is being written and is protected through the usage of the collection distributed lock.

This task is to get rid of this reliance on the collection distributed lock.

Assignee:: [DO NOT USE] Backlog - Sharding Team
Reporter:: Kaloian Manassiev
Participants:: [DO NOT USE] Backlog - Sharding Team, Asya Kamsky, Kaloian Manassiev, Randolph Tan
Votes:: 0 Vote for this issue
Watchers:: 6 Start watching this issue

Created:: Nov 30 2016 10:14:38 PM UTC
Updated:: Dec 06 2022 04:10:36 AM UTC
Resolved:: Jul 24 2021 06:41:40 AM UTC

Details

Description

Attachments

Forms

Activity

People

Dates