fla-org / flash-linear-attention Public

Notifications You must be signed in to change notification settings
Fork 560
Star 5.2k

Code
Issues 41
Pull requests 28
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Pull requests: fla-org/flash-linear-attention

Labels 18 Milestones 3

New pull request New

28 Open 570 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Model] Add Preconditioned Gated DeltaNet (PGDN) and KDA (PKDA)

#950 opened Jun 17, 2026 by ntumm120

Loading…

[GDN] Fix GDN precision on Blackwell

#948 opened Jun 14, 2026 by syeehyn

Loading…

[GDN] Restrict chunk_delta_h Blackwell autotune stages

#946 opened Jun 12, 2026 by IgorYashch

Loading…

[CI] Add ascend-a2-ci workflow for Atlas A2 NPU module tests

#944 opened Jun 12, 2026 by zheliuyu Contributor

Loading…

3 of 4 tasks

Bump torch from 2.6.0 to 2.12.0 dependencies

Pull requests that update a dependency file

python

Pull requests that update python code

#943 opened Jun 11, 2026 by dependabot Bot

Loading…

[Fix] parallel_attn: keep NV==1 in backward so dq/dk reduce over full V

#941 opened Jun 6, 2026 by kasper0406 Contributor

Loading…

[Attn] Add Parallax (parameterized local linear attention) op, layer and model

#939 opened Jun 5, 2026 by Yifei-Zuo

Loading…

[Ops] Propagate chunk_size through non-attention chunk kernels and add coverage

#935 opened Jun 4, 2026 by zhiyuan1i Collaborator

Loading…

[KDA] Add fused BT=16 inference kernels for KDA prefill

#915 opened May 22, 2026 by kuoihao

Loading…

[Fix] FLA cache window updates and reset behavior

#910 opened May 21, 2026 by Michael-RDev

Loading…

[Fix] Zero-init chunk-mode backward gradient buffers to prevent NaN propagation

#892 opened May 12, 2026 by xylian86

Loading…

[Fix] Fix shared memory race in tilelang chunk_bwd dg_last accumulation help wanted

Extra attention is needed

#890 opened May 11, 2026 by Erix025

Loading…

[SSE] Add SSE integration

#882 opened May 9, 2026 by Pan-Yuqi Contributor

Loading…

[KDA][AMD]for kda kernel,fix core dump on AMD GPU and tune the config for AMD branch

#869 opened Apr 29, 2026 by binding7012

Loading…

[Ops] Fix int32 overflow in pointer arithmetic across all Triton kernels

#818 opened Apr 8, 2026 by tmct Contributor • Draft

Add MALA (Magnitude-Aware Linear Attention) to FLA

#809 opened Apr 3, 2026 by drdanielwuwu

Loading…

feat: add Quasar Attention and standalone model implementation

#805 opened Mar 31, 2026 by troy12x

Loading…

[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform

#797 opened Mar 28, 2026 by hypnopump Contributor

Loading…

5 tasks

[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs

#796 opened Mar 28, 2026 by mpurland Contributor

Loading…

5 tasks done

[NPU] add NPU (Ascend) backend for chunk_gla

#737 opened Feb 5, 2026 by noemotiovon • Draft

Add fused short convolution kernel with L2 norm

#661 opened Nov 24, 2025 by sustcsonglin Collaborator

Loading…

[kda] add recursive block intra implementation

#656 opened Nov 22, 2025 by sustcsonglin Collaborator

Loading…

[Deltaformer] kernel improvement; if-else optimization; change w to fp32; add 1e-9 to avoid nan

#603 opened Sep 30, 2025 by foreverpiano

Loading…

Update README.md of ops delta_rule

#595 opened Sep 17, 2025 by SeepingFragranceLock Contributor

Loading…

Cached inference for NSA

#574 opened Aug 22, 2025 by mutiann Contributor

Loading…

Previous 1 2 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!