Loading...

XML

Word

Printable

JSON

Type: Improvement
Resolution: Fixed
Priority: Major - P3
Fix Version/s: 8.3.0-rc0
Affects Version/s: None
Component/s: None
Labels:
None

Assigned Teams:

Query Execution
Backwards Compatibility:
Fully Compatible
Sprint:
QE 2025-06-09, QE 2025-06-23, QE 2025-07-07, QE 2025-07-21
Linked BF Score:
200
CAR Domain/s:
None

Aha! Reference:
None
Tracking Level:
None
Risk Status:
None
Exec Notes:
None
Goal Name(s):
None
Goal Link:
None

After switching to the toolchain v5 as default in 341832a we've observed few performance regressions including a 10% regression in ElemMatchLargeMixedInAndOrWithDuplicates causing BF-37502.

The benchmark in question executes a simple find query with nested $elemMatch, $or, and $in predicates. Both $or and $in predicates have many arguments. The benchmark targets SBE engine (not yet enabled by default) which uses absl::raw_hash_set for $in predicate implementation.

Our initial investigation identified >10% regressions in absl::raw_hash_set::find() (~~SERVER-104956~~) and in sbe::vm::ByteCode::runInternal() (this ticket). After disassembling sbe::vm::ByteCode::runInternal() we saw 25% more instructions in v4 binary which might indicate at more aggressive code in-lining. On the other hand, v4 contains 172 more branching instructions pointing at some other possible optimization. Current hypothesis - we are dealing with different switch statement optimizations.

The goal of this ticket is to identify relevant compile-time optimization in v4 vs. v5 toolchain for this particular workload and suggest or force it on sbe::vm::ByteCode::runInternal().

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

bf-37502.js
May 08 2025 09:03:07 PM UTC
2 kB
Romans Kasperovics
branching-address-comparison.png
May 09 2025 08:12:20 AM UTC
281 kB
Romans Kasperovics
diff_flamegraph2.svg
May 08 2025 09:02:58 PM UTC
106 kB
Romans Kasperovics
jmp.patch
Jul 10 2025 09:27:56 AM UTC
2 kB
Mindaugas Malinauskas
jmpTrue.asm
Jul 10 2025 09:27:56 AM UTC
2 kB
Mindaugas Malinauskas
raw_hash_set_find_diff.svg
May 08 2025 09:03:03 PM UTC
488 kB
Romans Kasperovics
run-internal-v4-asm.txt
May 08 2025 09:02:29 PM UTC
254 kB
Romans Kasperovics
run-internal-v4-asm-annotated.txt
May 08 2025 09:02:29 PM UTC
314 kB
Romans Kasperovics
run-internal-v5-asm.txt
May 08 2025 09:02:29 PM UTC
202 kB
Romans Kasperovics
run-internal-v5-asm-annotated.txt
May 08 2025 09:02:29 PM UTC
253 kB
Romans Kasperovics

is depended on by

SERVER-104990 Query Execution work to resolve toolchain upgrade performance changes

Closed

is related to

SERVER-104956 Improve absl::raw_hash_set::find() performance on v5 toolchain

Closed

related to

SERVER-107242 Improve performance of ByteCode::getField()

Closed

Assignee:: Ivan Fefer
Reporter:: Romans Kasperovics
Participants:: Githook User, Ivan Fefer, Romans Kasperovics
Votes:: 0 Vote for this issue
Watchers:: 9 Start watching this issue

Created:: May 08 2025 08:48:16 PM UTC
Updated:: Mar 17 2026 12:55:17 PM UTC
Resolved:: Mar 17 2026 12:40:13 PM UTC

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

PagerDuty