Skip to content

[Bug] nvcc related crash on first forward pass - resolved with FLA_DISABLE_BACKEND_DISPATCH=1 #949

Description

@mramendi

Checklist

  • I have checked FAQs and existing issues for similar problems
  • Please report this bug in English to ensure wider understanding and support

Describe the Bug

When I run my training script after installing flash-linear-attention[cuda] , I get a complicated crash involving nvcc and incompatible headers. There is no nvcc on my system. I captured the full output of the script, including the crash, in the attached file.

error.txt

Thanks to Claude Code, I found a workaround. I set FLA_DISABLE_BACKEND_DISPATCH=1 and the system works.

If this cannot be easily fixed, could this workaround be added to the readme or FAQ?

Steps to Reproduce the Bug

Use standard Trainer to train a Qwen 3.5 model after installing flash-linear-attention[cuda] and causal-conv1d . The crash apparently happens at the first forward pass.

Expected Behavior

training goes on

Environment Information

  1. Torch: 2.11.0
  2. Triton: 3.6.0
  3. CUDA 13.0 ; container image based on the NVidia CUDA 13.0 runtime image with Ubuntu 24.04

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions