Skip to content

Commit a0ae7bc

Browse files
committed
Fix default values in Key arguments table
1 parent 0c1e43f commit a0ae7bc

1 file changed

Lines changed: 3 additions & 3 deletions

File tree

training/deepspeed_finetune_demo/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -20,14 +20,14 @@ For example, if we want to run Qwen2.5-3B model with ZeRO offload on 2 GPUs, we
2020
| Argument | Description | Default |
2121
|----------|-------------|---------|
2222
| `--batch_size` | Training batch size per GPU | required |
23-
| `--eval_batch_size` | Eval batch size per rank | 1 |
23+
| `--eval_batch_size` | Eval batch size per rank | 4 |
2424
| `--eval_steps` | Run evaluation every N steps (0 disables) | 0 |
2525
| `--max_steps` | Stop after N steps (-1 = full epoch) | -1 |
2626
| `--checkpoint_steps` | Save a checkpoint every N steps (0 disables); keeps last 2 | 0 |
2727
| `--wandb_name` | Wandb run name (optional) | None |
28-
| `--num_train_epochs` | Number of training epochs | 1 |
28+
| `--num_train_epochs` | Number of training epochs | 3 |
2929
| `--weight_decay` | Weight decay | 0.01 |
30-
| `--warmup` | Warmup steps | 0 |
30+
| `--warmup` | Warmup ratio | 0.01 |
3131

3232
Note: Learning rate is controlled entirely by the DeepSpeed config JSON, not by command-line arguments.
3333

0 commit comments

Comments
 (0)