Support jit execution of aggregate functions to improve performance of aggregate operator#660
Open
taiyang-li wants to merge 98 commits into
Open
Support jit execution of aggregate functions to improve performance of aggregate operator#660taiyang-li wants to merge 98 commits into
taiyang-li wants to merge 98 commits into
Conversation
added 30 commits
June 23, 2026 16:52
added 26 commits
June 23, 2026 16:53
Derive chunk function names from compact hashed slot descriptions and avoid storing derived function-name members. Simplify JIT planning debug output to reuse slot and chunk descriptions.
Keep only planned JIT chunks on GroupingSet and allocate transient add/extract runtimes locally so per-call state does not persist on the operator.
Submit each chunk's codegen to the global CPU executor instead of compiling synchronously on the first batch. Chunks start not-ready (std::atomic<bool> ready_) and the query thread falls back to the non-JIT path until compilation completes, then switches to JIT. Also removes benchmark warmup so first-batch compile latency is measured.
|
liyang.127 seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account. You have signed the CLA already but the status is still pending? Let us recheck it. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What problem does this PR solve?
Issue Number: close #xxx
Type of Change
Description
Hash Aggregation JIT is a compilation optimization technique for hash aggregation operators in the Bolt engine. The core idea is to generate a JIT kernel for "similar numerical aggregation functions" in HashAgg (such as multiple sum(xi)), merging multiple virtual function calls with scattered small loops into a compact, vectorizable update loop, thereby significantly reducing CPU overhead.
Performance Impact
benchmark report
Release Note
Please describe the changes in this PR
Release Note:
Checklist (For Author)
Breaking Changes
No
Yes (Description: ...)
Click to view Breaking Changes