Skip to content

Pull requests: fla-org/flash-linear-attention

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[GDN] Fix GDN precision on Blackwell
#948 opened Jun 14, 2026 by syeehyn Loading…
[CI] Add ascend-a2-ci workflow for Atlas A2 NPU module tests
#944 opened Jun 12, 2026 by zheliuyu Contributor Loading…
3 of 4 tasks
Bump torch from 2.6.0 to 2.12.0 dependencies Pull requests that update a dependency file python Pull requests that update python code
#943 opened Jun 11, 2026 by dependabot Bot Loading…
[Fix] Fix shared memory race in tilelang chunk_bwd dg_last accumulation help wanted Extra attention is needed
#890 opened May 11, 2026 by Erix025 Loading…
[SSE] Add SSE integration
#882 opened May 9, 2026 by Pan-Yuqi Contributor Loading…
[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform
#797 opened Mar 28, 2026 by hypnopump Contributor Loading…
5 tasks
[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs
#796 opened Mar 28, 2026 by mpurland Contributor Loading…
5 tasks done
Add fused short convolution kernel with L2 norm
#661 opened Nov 24, 2025 by sustcsonglin Collaborator Loading…
[kda] add recursive block intra implementation
#656 opened Nov 22, 2025 by sustcsonglin Collaborator Loading…
Update README.md of ops delta_rule
#595 opened Sep 17, 2025 by SeepingFragranceLock Contributor Loading…
Cached inference for NSA
#574 opened Aug 22, 2025 by mutiann Contributor Loading…
ProTip! Adding no:label will show everything without a label.