Skip to content

Add support for models using final_logits_softcapping to AsyncGRPOTrainer #5692

@mlarnouhet

Description

@mlarnouhet

Feature request

Add support for models using final_logits_softcapping to AsyncGRPOTrainer

Motivation

Some models (like gemma 2) use final_logits_softcapping and currently cannot be run with AsyncGRPOTrainer.

Your contribution

I would be happy to collaborate on this feature request by creating a potential PR.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions