Benchmarks: Add Mixture of Experts Model by dpower4 · Pull Request #679 · microsoft/superbenchmark

dpower4 · 2024-12-19T17:11:45Z

Added MoE model using MixtralConfig.

Added 8x7b and 8x22b variants
Requires high VRAM as all experts are loaded in memory. Thus, disabled training due to memory constraint on test worker.

codecov · 2024-12-19T17:55:44Z

Codecov Report

Attention: Patch coverage is 88.23529% with 16 lines in your changes missing coverage. Please review.

Project coverage is 86.47%. Comparing base (deef9a3) to head (434e442).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...enchmarks/model_benchmarks/pytorch_mixtral_impl.py	86.20%	16 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #679      +/-   ##
==========================================
+ Coverage   86.44%   86.47%   +0.03%     
==========================================
  Files         100      102       +2     
  Lines        7406     7541     +135     
==========================================
+ Hits         6402     6521     +119     
- Misses       1004     1020      +16

Flag	Coverage Δ
cpu-python3.10-unit-test	`71.59% <38.80%> (-0.61%)`	⬇️
cpu-python3.12-unit-test	`71.59% <38.80%> (-0.61%)`	⬇️
cpu-python3.7-unit-test	`70.65% <9.55%> (-1.15%)`	⬇️
cuda-unit-test	`83.98% <85.82%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…fp8)

Copilot

Pull Request Overview

This PR introduces a new Mixture of Experts (MoE) model variant using MixtralConfig with two parameter sets (8x7b and 8x22b), along with associated tests and documentation updates. It includes adding version checks for Python, conditional imports, benchmark registrations, and exporting support for the new model.

Reviewed Changes

Copilot reviewed 6 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/helper/decorator.py	Added a Python version check decorator for tests.
tests/benchmarks/model_benchmarks/test_pytorch_mixtral.py	Introduced tests for the new Mixtral MoE benchmark (8x7b variant).
superbench/benchmarks/model_benchmarks/pytorch_mixtral.py	Implemented the Mixtral benchmark model and registered two variants.
superbench/benchmarks/model_benchmarks/init.py	Updated module imports and all to conditionally include MoE model.
superbench/benchmarks/micro_benchmarks/_export_torch_to_onnx.py	Extended ONNX export support to include Mixtral models.
docs/user-tutorial/benchmarks/model-benchmarks.md	Added documentation for MoE models.

Files not reviewed (1)

docs/superbench-config.mdx: Language not supported

Comments suppressed due to low confidence (1)

superbench/benchmarks/micro_benchmarks/_export_torch_to_onnx.py:28

[nitpick] Consider renaming this class to Torch2ONNXExporter to adhere to standard Python CamelCase naming conventions.

class torch2onnxExporter():

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Description Add release note for v0.12.0 # Main Features ## SuperBench Improvement 1. - [x] Update Image Build Pipeline (#659) 2. - [x] Add support for arm64 build (#660) 3. - [x] Upgrade dependency versions in pipeline (#671) 4. - [x] Fix installation and lint issues (#684) 5. - [x] Update Flake8 repo (#683) 6. - [x] Init latest python support. (#687) 7. - [x] Add image build on arm64 arch (#690) 8. - [x] Enhancement of ignoring errors for import pkg_resources (#692) 9. - [x] Update label in the ROCm image build (#693) 10. - [x] Support cuda12.8 for Blackwell arch (#682) 11. - [x] Merge multi-arch image (#696) 12. - [x] Update OS of runner to the latest. (#702) 13. - [x] cuda arch flag for cublaslt (#701) ## Micro-benchmark Improvement 1. - [x] Bug Fix - Fix numa error on grace cpu in gpu-copy (#658) 2. - [x] Dependency - Bump onnxruntime-gpu version from 1.10.0 to 1.12.0 (#663) 3. - [x] Benchmarks: micro benchmarks - add general CPU bandwidth and latency benchmark (#662) 4. - [x] Benchmarks: micro benchmarks - add nvbandwidth build and benchmark (#665 and #669) 5. - [x] Fix stderr message in gpu-copy benchmark (#673) 6. - [x] Add arch support for 10.0 in gemm-flops (#680) 7. - [x] Fix tensorrt-inference parsing (#674) 8. - [x] nvbandwidth benchmark need to handle N/A value (#675) 9. - [x] Avoid Unintended nvbandwidth Function Calls in All Benchmarks (#685) 10. - [x] Add GPU Stream Micro Benchmark (#697) 11. - [x] Cuda arch flag for cublaslt (#701) 12. - [x] Support autotuning in cublaslt gemm (#706) 14. - [x] Add FP4 GEMM FLOPS support for cublaslt_gemm benchmark (#711) 15. - [x] CPU Stream Benchmark Revise (#712) 16. - [x] Add cuda12.9 docker image (#716) 17. - [x] Add Grace CPU support for CPU Stream (#719) ## Model Benchmark Improvement 1. - [x] Add LLaMA-2 Models (#668) 2. - [x] Fix typos in documentation and code files (#686) 3. - [x] Add Mixture of Experts Model (#679) 4. - [ ] Add DeepSeek Training Benchmark 5. - [x] Add DeepSeek Inference Benchmark (AMD GPU) (#713) ## Documentation 1. - [x] Update CODEOWNERS (#670) 2. - [x] Update CODEOWNERS (#718) ## Result Analysis 1. - [x] Enhance logging information for diagnosis rule op baseline errors. (#689)

dpower4 added 4 commits December 18, 2024 09:30

add mixtral

54aed28

fp16 unittest

deeedde

update docs

5f5a75d

remove train unit test due to memroy constraints

025256e

dpower4 requested review from a team, cp5555 and guoshzhao as code owners December 19, 2024 17:11

dpower4 added 2 commits December 19, 2024 09:31

lint fix

0273915

lint fix

58c5de9

dpower4 added 13 commits December 19, 2024 11:31

disable py3.7 tests for mixtral

0e4e9c6

enable py3.7 checks for mixtral

64abec0

fix lint

793713c

fix lint

60df7ed

fix mixtal model benchmark for 3.7

8782404

fix lint

c5b87ac

fix lint F401 warning

bf780d0

fix lint error

48e67f8

check py version to >=3.8 for mixtral

fec4df3

cleanup

9272769

cleanup

795a359

reduce mixtral dims to reduce vram req

f36f4f7

mixtral uni test to float16 instead of fp8 (worker 8.9 or higher for …

83c9981

…fp8)

dpower4 added benchmarks SuperBench Benchmarks micro-benchmarks Micro Benchmark Test for SuperBench Benchmarks model-benchmarks Model Benchmark Test for SuperBench Benchmarks labels Dec 31, 2024

dpower4 requested a review from abuccts December 31, 2024 01:46

abuccts requested a review from Copilot April 21, 2025 22:32

Copilot AI reviewed Apr 21, 2025

View reviewed changes

Comment thread superbench/benchmarks/model_benchmarks/pytorch_mixtral.py Outdated

abuccts reviewed Apr 30, 2025

View reviewed changes

Comment thread superbench/benchmarks/model_benchmarks/__init__.py Outdated

Comment thread superbench/benchmarks/micro_benchmarks/_export_torch_to_onnx.py Outdated

Comment thread tests/helper/decorator.py Outdated

guoshzhao mentioned this pull request May 14, 2025

V0.12.0 Release Plan #710

Closed

40 tasks

polarG and others added 8 commits June 26, 2025 10:26

Update superbench/benchmarks/model_benchmarks/pytorch_mixtral.py

83ebcbe

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Remove redundant code.

ecbdc46

Merge branch 'main' into feat/mixtral

38e88a4

Fix lint issues.

89beafd

Merge branch 'main' into feat/mixtral

3ae4600

Move the version check to pytorch_mixtral.py

ace2f1b

Fix lint issues.

659bf3b

Fix py3.7 unittest.

434e442

polarG approved these changes Jun 28, 2025

View reviewed changes

polarG enabled auto-merge (squash) June 28, 2025 05:13

guoshzhao approved these changes Jun 29, 2025

View reviewed changes

polarG merged commit 44e35cd into microsoft:main Jun 30, 2025
21 of 22 checks passed

polarG mentioned this pull request Aug 6, 2025

Docs - Upgrade version and release note #727

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks: Add Mixture of Experts Model #679

Benchmarks: Add Mixture of Experts Model #679
polarG merged 27 commits into
microsoft:mainfrom
dpower4:feat/mixtral

dpower4 commented Dec 19, 2024

Uh oh!

codecov Bot commented Dec 19, 2024 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

dpower4 commented Dec 19, 2024

Uh oh!

codecov Bot commented Dec 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov Bot commented Dec 19, 2024 •

edited

Loading