Loading...

XML

Word

Printable

JSON

Type: Task
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Execution
Confidence Status:
None
Work Order:
0

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name:
None
Goal Link:
None

While spilling in SBE group and lookup stages, we insert record by record to the spill store (each followed by a commit), instead of inserting a batch of records together and reducing the number of IO. Note the classic document sources which uses filesystem for spilling does batching when writing to files.

I tested with a small patch that does batching when spilling in hash_group.cpp and the throughput of the $group queries that spill increases upto 25% (higher improvement when there are many groups and the individual groups are small leading to more spill records)

Assignee:: Unassigned

Reporter:: Projjal Chanda

Participants:: Projjal Chanda

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: Nov 13 2024 05:07:53 PM UTC

Updated:: Apr 07 2025 01:36:22 PM UTC

Details

Description

Attachments

Activity

People

Dates