[HIP] Add offload PGO tiled matmul E2E test by yxsamliu · Pull Request #366 · llvm/llvm-test-suite

yxsamliu · 2026-03-10T13:58:42Z

[HIP] Add offload PGO tiled matmul E2E test

Add a tiled matrix multiply kernel that demonstrates the offload PGO
workflow on AMDGPU. The kernel uses a large per-thread sub-tile
(configurable via -DTH_M and -DTH_N) with LDS-based cooperative tile
loading, creating natural register pressure that exceeds the VGPR
budget and causes spills. Boundary tile handling creates biased
branches that PGO can optimize by guiding the register allocator to
reduce spills on the hot path.

Sub-tile sizes are tunable per architecture to induce spills on GPUs
with different register file sizes.

Two tests are registered:

pgo-tiled-matmul: correctness test (compile + run + verify)
pgo-tiled-matmul-pipeline: full PGO pipeline test
(baseline -> instrument -> collect -> merge -> PGO build -> compare)

The pipeline test verifies that the full -fprofile-generate /
-fprofile-use workflow completes successfully and reports the
performance difference for information.

jmmartinez

Looks good to me but I'll JP have the final word.

jplehr

LG
Just comments about the GPU arch of the bots.

Reporting the actual PGO speed-up just for information purpose is good.

Add a tiled matrix multiply kernel that demonstrates the offload PGO workflow on AMDGPU. The kernel uses a large per-thread sub-tile (configurable via -DTH_M and -DTH_N) with LDS-based cooperative tile loading, creating natural register pressure that exceeds the VGPR budget and causes spills. Boundary tile handling creates biased branches that PGO can optimize by guiding the register allocator to reduce spills on the hot path. Sub-tile sizes are tunable per architecture to induce spills on GPUs with different register file sizes. Two tests are registered: - pgo-tiled-matmul: correctness test (compile + run + verify) - pgo-tiled-matmul-pipeline: full PGO pipeline test (baseline -> instrument -> collect -> merge -> PGO build -> compare) The pipeline test verifies that the full -fprofile-generate / -fprofile-use workflow completes successfully and reports the performance difference for information.

yxsamliu · 2026-06-15T13:10:18Z

Gentle ping. The llvm-zorg PR needed by this test has landed now: llvm/llvm-zorg#868

All review comments here have been addressed and resolved. Could you take another look when you get a chance?

yxsamliu requested review from jmmartinez and jplehr March 10, 2026 14:03

jmmartinez reviewed Mar 10, 2026

View reviewed changes

Comment thread External/HIP/pgo-tiled-matmul.hip Outdated

Comment thread External/HIP/workload/pgo/test_pgo_matmul.sh.in Outdated

jmmartinez reviewed Mar 10, 2026

View reviewed changes

jplehr reviewed Mar 11, 2026

View reviewed changes

Comment thread External/HIP/pgo-tiled-matmul.hip Outdated

Comment thread External/HIP/workload/pgo/test_pgo_matmul.sh.in Outdated

yxsamliu force-pushed the amd/dev/yaxunl/pgo-tiled-matmul-test branch from 7e4c1ad to 1edfa31 Compare June 10, 2026 02:26

yxsamliu force-pushed the amd/dev/yaxunl/pgo-tiled-matmul-test branch from 1edfa31 to f6050ee Compare June 10, 2026 13:34

yxsamliu mentioned this pull request Jun 10, 2026

Build AMDGPU profile runtime in HIP buildbot llvm/llvm-zorg#868

Merged

jmmartinez approved these changes Jun 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[HIP] Add offload PGO tiled matmul E2E test#366

[HIP] Add offload PGO tiled matmul E2E test#366
yxsamliu wants to merge 1 commit into
llvm:mainfrom
yxsamliu:amd/dev/yaxunl/pgo-tiled-matmul-test

yxsamliu commented Mar 10, 2026

Uh oh!

Uh oh!

Uh oh!

jmmartinez left a comment

Uh oh!

jplehr left a comment

Uh oh!

Uh oh!

Uh oh!

yxsamliu commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

yxsamliu commented Mar 10, 2026

Uh oh!

Uh oh!

Uh oh!

jmmartinez left a comment

Choose a reason for hiding this comment

Uh oh!

jplehr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yxsamliu commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants