-
Notifications
You must be signed in to change notification settings - Fork 560
Pull requests: fla-org/flash-linear-attention
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] Add Preconditioned Gated DeltaNet (PGDN) and KDA (PKDA)
#950
opened Jun 17, 2026 by
ntumm120
Loading…
[GDN] Restrict chunk_delta_h Blackwell autotune stages
#946
opened Jun 12, 2026 by
IgorYashch
Loading…
[CI] Add ascend-a2-ci workflow for Atlas A2 NPU module tests
#944
opened Jun 12, 2026 by
zheliuyu
Contributor
Loading…
3 of 4 tasks
Bump torch from 2.6.0 to 2.12.0
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#943
opened Jun 11, 2026 by
dependabot
Bot
Loading…
[Fix] parallel_attn: keep NV==1 in backward so dq/dk reduce over full V
#941
opened Jun 6, 2026 by
kasper0406
Contributor
Loading…
[Attn] Add Parallax (parameterized local linear attention) op, layer and model
#939
opened Jun 5, 2026 by
Yifei-Zuo
Loading…
[Ops] Propagate chunk_size through non-attention chunk kernels and add coverage
#935
opened Jun 4, 2026 by
zhiyuan1i
Collaborator
Loading…
[KDA] Add fused BT=16 inference kernels for KDA prefill
#915
opened May 22, 2026 by
kuoihao
Loading…
[Fix] Zero-init chunk-mode backward gradient buffers to prevent NaN propagation
#892
opened May 12, 2026 by
xylian86
Loading…
[Fix] Fix shared memory race in tilelang chunk_bwd dg_last accumulation
help wanted
Extra attention is needed
#890
opened May 11, 2026 by
Erix025
Loading…
[KDA][AMD]for kda kernel,fix core dump on AMD GPU and tune the config for AMD branch
#869
opened Apr 29, 2026 by
binding7012
Loading…
feat: add Quasar Attention and standalone model implementation
#805
opened Mar 31, 2026 by
troy12x
Loading…
[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform
#797
opened Mar 28, 2026 by
hypnopump
Contributor
Loading…
5 tasks
[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs
#796
opened Mar 28, 2026 by
mpurland
Contributor
Loading…
5 tasks done
Add fused short convolution kernel with L2 norm
#661
opened Nov 24, 2025 by
sustcsonglin
Collaborator
Loading…
[kda] add recursive block intra implementation
#656
opened Nov 22, 2025 by
sustcsonglin
Collaborator
Loading…
[Deltaformer] kernel improvement; if-else optimization; change w to fp32; add 1e-9 to avoid nan
#603
opened Sep 30, 2025 by
foreverpiano
Loading…
Update README.md of ops delta_rule
#595
opened Sep 17, 2025 by
SeepingFragranceLock
Contributor
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.