Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Unresolved
Priority: Major - P3
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Execution
Sprint:
QE 2025-06-09, QE 2025-06-23, QE 2025-07-07, QE 2025-07-21
Linked BF Score:
200
Confidence Status:
None
Work Order:
3
Size Category:
TBD
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After switching to the toolchain v5 as default in 341832a we've observed few performance regressions including a 10% regression in ElemMatchLargeMixedInAndOrWithDuplicates causing BF-37502.

The benchmark in question executes a simple find query with nested $elemMatch, $or, and $in predicates. Both $or and $in predicates have many arguments. The benchmark targets SBE engine (not yet enabled by default) which uses absl::raw_hash_set for $in predicate implementation.

Our initial investigation identified >10% regressions in absl::raw_hash_set::find() (~~SERVER-104956~~) and in sbe::vm::ByteCode::runInternal() (this ticket). After disassembling sbe::vm::ByteCode::runInternal() we saw 25% more instructions in v4 binary which might indicate at more aggressive code in-lining. On the other hand, v4 contains 172 more branching instructions pointing at some other possible optimization. Current hypothesis - we are dealing with different switch statement optimizations.

The goal of this ticket is to identify relevant compile-time optimization in v4 vs. v5 toolchain for this particular workload and suggest or force it on sbe::vm::ByteCode::runInternal().

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

bf-37502.js
2 kB
May 08 2025 09:03:07 PM UTC
branching-address-comparison.png
281 kB
May 09 2025 08:12:20 AM UTC
diff_flamegraph2.svg
106 kB
May 08 2025 09:02:58 PM UTC
jmp.patch
2 kB
Jul 10 2025 09:27:56 AM UTC
jmpTrue.asm
2 kB
Jul 10 2025 09:27:56 AM UTC
raw_hash_set_find_diff.svg
488 kB
May 08 2025 09:03:03 PM UTC
run-internal-v4-asm.txt
254 kB
May 08 2025 09:02:29 PM UTC
run-internal-v4-asm-annotated.txt
314 kB
May 08 2025 09:02:29 PM UTC
run-internal-v5-asm.txt
202 kB
May 08 2025 09:02:29 PM UTC
run-internal-v5-asm-annotated.txt
253 kB
May 08 2025 09:02:29 PM UTC

is depended on by

SERVER-104990 Query Execution work to resolve toolchain upgrade performance changes

Closed

is related to

SERVER-104956 Improve absl::raw_hash_set::find() performance on v5 toolchain

Closed

related to

SERVER-107242 Improve performance of ByteCode::getField()

Closed

Assignee:: Unassigned
Reporter:: Romans Kasperovics
Participants:: Romans Kasperovics
Votes:: 0 Vote for this issue
Watchers:: 8 Start watching this issue

Created:: May 08 2025 08:48:16 PM UTC
Updated:: Jul 21 2025 02:57:56 PM UTC
Confidence Status Last Update:: 30/Jun/25 8:05 AM

Details

Description

Attachments

Attachments

Issue Links

Forms

Activity

People

Dates