-
Notifications
You must be signed in to change notification settings - Fork 138
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Dockerfiles and documentation for v0.19.1.post1 release
documentation
Improvements or additions to documentation
skip-gaudi-tests
#1544
opened Jun 12, 2026 by
PatrykWo
Collaborator
Loading…
Reapply path for block_size for non-GDN hybrid models
#1543
opened Jun 11, 2026 by
rsmyrek
Contributor
Loading…
[FIX_FOR_VLLM_CUSTOM=d841386d272200dd381a5791833771db9a47adf7] Adapt HPU MoE to the FusedMoE/MoERunner inversion refactor
#1536
opened Jun 9, 2026 by
pawel-olejniczak
Collaborator
Loading…
Reapply path for block_size for non-GDN hybrid models
#1535
opened Jun 9, 2026 by
rsmyrek
Contributor
Loading…
[WIP][GDN] Integrate gdn_read_decayed_state TPC kernel into decode path
#1531
opened Jun 8, 2026 by
osavchenkox
Contributor
Loading…
4 of 6 tasks
Reapply path for block_size for non-GDN hybrid models
#1523
opened Jun 3, 2026 by
ksmusz
Contributor
Loading…
dynamic_quant: 20% improvements in per-channel mode
#1512
opened Jun 1, 2026 by
osavchenkox
Contributor
Loading…
fix: raise default max_cudagraph_capture_size floor to 16384
#1502
opened May 27, 2026 by
kamil-kaczor
Contributor
Loading…
1 of 3 tasks
fix: prevent eager-mode env vars leaking to lazy-mode subprocesses
#1501
opened May 27, 2026 by
kamil-kaczor
Contributor
Loading…
3 of 4 tasks
[DO_NOT_MERGE] Skip gather/index_copy for sequential state indices in GDN decode
#1487
opened May 24, 2026 by
osavchenkox
Contributor
Loading…
[DO_NOT_MERGE] Use HPU causal_conv1d_update TPC kernel instead of multiple ops
#1484
opened May 23, 2026 by
osavchenkox
Contributor
Loading…
[DO_NOT_MERGE] Remove repeat_interleave from GDN recurrent path
#1483
opened May 22, 2026 by
osavchenkox
Contributor
Loading…
[DO_NOT_MERGE] Use l2Norm TPC kernel for HPU compile
#1457
opened May 19, 2026 by
osavchenkox
Contributor
Loading…
Global patch for t.compile high warmup time on MoE models
#1452
opened May 17, 2026 by
tvoas
Contributor
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.