[NPU]: support use-rollout-routing-replay by Windfeng8 · Pull Request #292 · vllm-project/vime

Windfeng8 · 2026-06-23T12:46:11Z

No description provided.

read-the-docs-community · 2026-06-23T12:47:15Z

Documentation build overview

📚 vime | 🛠️ Build #33270785 | 📁 Comparing 5c4969a against latest (e62d44f)

🔍 Preview build

8 files changed · + 1 added · ± 6 modified · - 1 deleted

+ Added

get_started/NPU.html

± Modified

- Deleted

_examples_synced/tau-bench/README.html

gemini-code-assist

Code Review

This pull request introduces a new training script run-qwen3-30B-A3B-npu-use-routing-replay.sh for running Qwen3-30B-A3B training on NPUs. Feedback on the script highlights two main issues: first, there are duplicate arguments (--use-dynamic-batch-size and --max-tokens-per-gpu) with conflicting values (8192 vs 20480) that need to be cleaned up; second, a hardcoded user directory path is used for the prompt dataset, which should be replaced with a configurable environment variable to improve portability.

gemini-code-assist · 2026-06-23T12:49:29Z

+  --use-dynamic-batch-size \
+  --max-tokens-per-gpu 8192 \


The arguments --use-dynamic-batch-size and --max-tokens-per-gpu are duplicated in this script (see lines 90-91). Additionally, --max-tokens-per-gpu is specified with conflicting values (8192 vs 20480). Please remove the duplicate entries and keep only the correct configuration (likely 20480 to match the max model length of 20k).

gemini-code-assist · 2026-06-23T12:49:29Z

+  \
+  --hf-checkpoint /home/data/Qwen3-30B-A3B/ \
+  \
+  --prompt-data /home/w00893744/dataset/dapo-math-17k/dapo-math-17k.jsonl \


The path /home/w00893744/dataset/dapo-math-17k/dapo-math-17k.jsonl contains a hardcoded user directory (w00893744), which prevents the script from being portable across different environments or users. Consider using a configurable environment variable with a fallback default.

Suggested change

--prompt-data /home/w00893744/dataset/dapo-math-17k/dapo-math-17k.jsonl \

--prompt-data "${PROMPT_DATA:-/home/data/dapo-math-17k/dapo-math-17k.jsonl}" \

…replay.sh

CalvinXKY · 2026-06-25T07:02:15Z

I'd suggest putting the sh script in #270, and also using the ray submit submission format.

fix: use rollout routing replay

2006e69

gemini-code-assist Bot reviewed Jun 23, 2026

View reviewed changes

refactor: rename script to run-qwen3-30B-A3B-npu-use-rollout-routing-…

5c4969a

…replay.sh

Windfeng8 mentioned this pull request Jun 24, 2026

[RFC] Steps and Test Result for Running Qwen3-30B on NPU(A3) （use-rollout-routing-replay） #270

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NPU]: support use-rollout-routing-replay#292

[NPU]: support use-rollout-routing-replay#292
Windfeng8 wants to merge 2 commits into
vllm-project:ascendfrom
Windfeng8:fix/routing-replay-v2

Windfeng8 commented Jun 23, 2026

Uh oh!

read-the-docs-community Bot commented Jun 23, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Uh oh!

CalvinXKY commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	--prompt-data /home/w00893744/dataset/dapo-math-17k/dapo-math-17k.jsonl \
	--prompt-data "${PROMPT_DATA:-/home/data/dapo-math-17k/dapo-math-17k.jsonl}" \

Uh oh!

Conversation

Windfeng8 commented Jun 23, 2026

Uh oh!

read-the-docs-community Bot commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

CalvinXKY commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

read-the-docs-community Bot commented Jun 23, 2026 •

edited

Loading