Skip to content

Pull requests: swiss-ai/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add quantile balancing MoE router
#135 opened Jun 24, 2026 by andresnowak Draft
Add token mapping script for datasets
#130 opened Jun 22, 2026 by stefantaga24 Loading…
Top-K Logits Distillation
#90 opened Sep 30, 2025 by BlackSamorez Loading…
NGC25.05 + Fix Xielu
#87 opened Jul 31, 2025 by TJ-Solergibert Loading…
Update update upstream
#84 opened Jun 25, 2025 by AleHD Collaborator Loading…
Update upstream
#82 opened Jun 25, 2025 by AleHD Collaborator Loading…
Data Mixture Modification Script
#74 opened May 27, 2025 by alexdremov Loading…
swiss
#73 opened May 23, 2025 by xrsrke Loading…
Adding CSCS' XIELU to Megatron-LM
#71 opened May 6, 2025 by rubber-duck-debug Loading…
Minor sbatch fixes
#68 opened Apr 1, 2025 by henrique Loading…
Log all grad norms
#50 opened Feb 27, 2025 by dhia680 Member Loading…
Process error file - v0
#49 opened Feb 27, 2025 by dhia680 Member Loading…
Update fork
#34 opened Feb 14, 2025 by AleHD Collaborator Loading…
Slack bot
#16 opened Feb 10, 2025 by dhia680 Member Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.