Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
-
Updated
Feb 5, 2026 - Jupyter Notebook
Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
Coursework for CMU 11-868: Large Language Model Systems.
Add a description, image, and links to the machine-learning-system topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-system topic, visit your repo's landing page and select "manage topics."