Issues · NVIDIA/NeMo-Aligner · GitHub

This repository was archived by the owner on Nov 19, 2025. It is now read-only.

[Important] NeMo-Aligner deprecation notice (see NeMo RL)
#542 · terrykong opened on May 16, 2025

Labels Milestones

[Important] NeMo-Aligner deprecation notice (see NeMo RL)

#542

· terrykong opened

on May 16, 2025

PPOTrainer can't be imported when the slurm job has more than one node

#541

· mrm-196 opened

on May 1, 2025

RuntimeError: Error(s) in loading state_dict for GPTModel

#539

· mrm-196 opened

on Apr 17, 2025

Fomula confusion in distill loss funtion

#538

· yspMing opened

on Apr 14, 2025

ValueError: Expected a parent

#535

· starxa2 opened

on Mar 18, 2025

llama-70b SFT OSError: [Errno 5] Input/output error

#502

· songwangnlp opened

on Feb 7, 2025

Memory inefficiency when loading attention_mask, causing dataloader OOM with long context

#488

· shensimeteor opened

on Jan 22, 2025

ImportError: cannot import name 'MoESubmodules' from 'megatron.core.transformer.moe.moe_layer'

#483

· Cppowboy opened

on Jan 16, 2025

Support other RL algorithms(REINFORCE, ReMax, RLOO, REINFORCE++)

#481

· Cppowboy opened

on Jan 11, 2025

The version number is wrong

#476

· null-test-7 opened

on Jan 6, 2025

Out of Memory (OOM) During Training a LLaMA 7B Reward Model (8 A800 40GB GPUs)

#444

· qingyiaaaaa opened

on Dec 11, 2024

use lightning or pytorch-lightning

#438

· better629 opened

on Dec 10, 2024