Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- sbe-blocker

Assigned Teams:

Query Execution
Confidence Status:
None
Work Order:
3

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

If we run a simple test with a $group by query with many accumulators, SBE performs worse than classic, and the gap appears to increase as the number of accumulators grows.

For many queries the runtime is dominated by other work besides the accumulators (reading data, evaluating other expressions, etc). In these cases, the "regression" in time spent accumulating may not be visible at all. On the other hand, when running the accumulators is a large fraction of the query runtime, there is a clear difference.

Currently the only way to see this issue is through queries with a large (20+) number of a accumulators. However, when running a time series $group query in SBE, we see similar behavior. This is because with time series, the amount of work done to read each document is relatively small, so the $group-by processing represents a greater fraction of the runtime.

The issue appears to be most severe with $avg, presumably because the SBE implementation decomposes this into two separate accumulators (sum and count).

Assignee:: Kevin Cherkauer (Inactive)
Reporter:: Ian Boros
Participants:: Ian Boros, Kevin Cherkauer
Votes:: 0 Vote for this issue
Watchers:: 7 Start watching this issue

Created:: Oct 04 2023 06:07:39 PM UTC
Updated:: Apr 29 2025 03:16:27 PM UTC
Confidence Status Last Update:: 27/Mar/25 6:39 PM

Details

Description

Attachments

Activity

People

Dates