Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Minor - P4
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Atlas Streams
Confidence Status:
None
Work Order:
3
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

Consider a pipeline with a `hoppingWindow`, like this one:

pipeline: [{
$group:
{ _id:"$customerId", customerDocs: {$push:"$$ROOT"},
}
}]

Say there are 200 open windows. Now, an document will be absorbed into all these open windows. And, though the logical state size is O(200docs), the actual memory usage will still just be 1 doc since documents are cheaply copyable via ref-counting etc

Now, when such a state is checkpointed and recovered, we lose this sharing info and so today we will end up with 200 different docs after the recovery.

This causes ballooning in memory usage after the checkpoint has been recovered.

Assignee:: Unassigned
Reporter:: Mayuresh Kulkarni
Participants:: Mayuresh Kulkarni
Votes:: 0 Vote for this issue
Watchers:: 3 Start watching this issue

Created:: Dec 13 2023 04:33:59 PM UTC
Updated:: Oct 10 2024 04:32:53 PM UTC

Details

Description

Attachments

Activity

People

Dates