Tests - Add basic utilities for module-level numerical tests by yzygitzh · Pull Request #75 · microsoft/ltp-megatron-lm

yzygitzh · 2025-07-26T07:59:24Z

Adds basic utilities for module-level numerical tests, in tests/numerical_tests folder. Including:

modules/conftest.py: common arguments and fixtures for test classes.
modules/test_module.py: base class to run module-level numerical tests and dump results.
modules/test_utilities.py: revised Utils class from unit tests, to respect NVTE environmental variables from users.
utils/module_mean_and_std.py: calculate mean and std of several module test trials.
utils/module_similarity: calculate cosine similarity of two module test stats. The stats can be either raw value, mean or std.

Copilot

Pull Request Overview

This PR adds basic utilities for module-level numerical tests to enable consistent testing and comparison of module behavior across different configurations. The utilities provide infrastructure for running numerical tests, capturing module statistics, and comparing results between different test runs.

Key changes include:

Test infrastructure with base classes and configuration utilities
Statistical analysis tools for computing means, standard deviations, and similarity metrics
Environment variable management to preserve user NVTE settings during testing

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`tests/numerical_tests/modules/conftest.py`	Pytest configuration with result directory option and cleanup fixtures
`tests/numerical_tests/modules/test_module.py`	Base test class for module-level numerical tests with distributed setup and result saving
`tests/numerical_tests/modules/test_utilities.py`	Enhanced Utils class that preserves NVTE environment variables during testing
`tests/numerical_tests/utils/module_mean_and_std.py`	Statistical computation utility using Welford's algorithm for streaming mean/std calculation
`tests/numerical_tests/utils/module_similarity.py`	Cosine similarity calculation tool for comparing tensor statistics between test runs

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

cp5555 · 2025-07-28T05:55:41Z

+    with open(args.output_file, 'w') as f:
+        json.dump(comparison_result, f, indent=2)
+
+if __name__ == '__main__':


When do you use it?

cp5555 · 2025-07-28T05:56:17Z

+    torch.save(mean_result, args.output_mean_file)
+    torch.save(std_result, args.output_std_file)
+
+if __name__ == '__main__':


Why do we need main?

cp5555 · 2025-07-28T06:02:42Z

+
+
+def pytest_addoption(parser):
+    parser.addoption(


Why do we need to add required option to run the tests

cp5555 · 2025-07-28T06:06:43Z

+            bf16=config.bf16,
+            use_distributed_optimizer=True,
+            lr=1e-3,
+            clip_grad=0.0


Why is there hard code for adam, lr, etc.

abuccts · 2025-07-28T23:50:48Z

+        seed = 42
+        torch.manual_seed(seed)


do you need to set torch.cuda.manual_seed_all? and seed in python/numpy as well

abuccts · 2025-07-28T23:51:21Z

+        model_parallel_cuda_manual_seed(seed)
+
+    def teardown_method(self, method):
+        Utils.destroy_model_parallel()


destroy distributed as well?

abuccts · 2025-07-28T23:53:16Z

+    torch.save(mean_result, args.output_mean_file)
+    torch.save(std_result, args.output_std_file)


why not return the value directly?

github-actions · 2025-09-27T18:22:25Z

Marking as stale. No activity in 60 days.

add numerical test utils

023514c

yzygitzh requested a review from a team as a code owner July 26, 2025 07:59

yzygitzh added the CI/CD label Jul 26, 2025

yzygitzh changed the title ~~Test - Add basic utilities for module-level numerical tests~~ Tests - Add basic utilities for module-level numerical tests Jul 26, 2025

yzygitzh requested a review from Copilot July 28, 2025 03:11

Copilot AI reviewed Jul 28, 2025

View reviewed changes

Comment thread tests/numerical_tests/utils/module_similarity.py Outdated

Comment thread tests/numerical_tests/utils/module_similarity.py

Comment thread tests/numerical_tests/utils/module_similarity.py

Update tests/numerical_tests/utils/module_similarity.py

5c9a03b

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

cp5555 reviewed Jul 28, 2025

View reviewed changes

cp5555 approved these changes Jul 28, 2025

View reviewed changes

cp5555 requested a review from abuccts July 28, 2025 21:58

abuccts reviewed Jul 28, 2025

View reviewed changes

github-actions Bot added the stale label Sep 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests - Add basic utilities for module-level numerical tests#75

Tests - Add basic utilities for module-level numerical tests#75
yzygitzh wants to merge 2 commits into
devfrom
ziyue/pr-numerical-test-base

yzygitzh commented Jul 26, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cp5555 Jul 28, 2025

Uh oh!

cp5555 Jul 28, 2025

Uh oh!

cp5555 Jul 28, 2025

Uh oh!

cp5555 Jul 28, 2025

Uh oh!

abuccts Jul 28, 2025

Uh oh!

abuccts Jul 28, 2025

Uh oh!

abuccts Jul 28, 2025

Uh oh!

github-actions Bot commented Sep 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		torch.save(mean_result, args.output_mean_file)
		torch.save(std_result, args.output_std_file)

Conversation

yzygitzh commented Jul 26, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cp5555 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

cp5555 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

cp5555 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

cp5555 Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

abuccts Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

abuccts Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

abuccts Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Sep 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants