Checklist
Describe the Bug
When I run my training script after installing flash-linear-attention[cuda] , I get a complicated crash involving nvcc and incompatible headers. There is no nvcc on my system. I captured the full output of the script, including the crash, in the attached file.
error.txt
Thanks to Claude Code, I found a workaround. I set FLA_DISABLE_BACKEND_DISPATCH=1 and the system works.
If this cannot be easily fixed, could this workaround be added to the readme or FAQ?
Steps to Reproduce the Bug
Use standard Trainer to train a Qwen 3.5 model after installing flash-linear-attention[cuda] and causal-conv1d . The crash apparently happens at the first forward pass.
Expected Behavior
training goes on
Environment Information
- Torch: 2.11.0
- Triton: 3.6.0
- CUDA 13.0 ; container image based on the NVidia CUDA 13.0 runtime image with Ubuntu 24.04
Checklist
Describe the Bug
When I run my training script after installing flash-linear-attention[cuda] , I get a complicated crash involving nvcc and incompatible headers. There is no nvcc on my system. I captured the full output of the script, including the crash, in the attached file.
error.txt
Thanks to Claude Code, I found a workaround. I set FLA_DISABLE_BACKEND_DISPATCH=1 and the system works.
If this cannot be easily fixed, could this workaround be added to the readme or FAQ?
Steps to Reproduce the Bug
Use standard Trainer to train a Qwen 3.5 model after installing flash-linear-attention[cuda] and causal-conv1d . The crash apparently happens at the first forward pass.
Expected Behavior
training goes on
Environment Information