Add OnnxDiscrepancyCheck speedup metric with default timing updates by xadupre · Pull Request #2502 · microsoft/Olive

xadupre · 2026-06-05T15:50:20Z

Describe your changes

Added speedup measurement for OnnxDiscrepancyCheck and updated behavior based on review feedback:

Changed timing_iterations default from 10 to 5.
If timing_iterations is set to 0, speedup measurement is skipped.
Added unit tests to validate the new default and the skip behavior.
Fixed test mocks to properly configure device attribute for compare_generation tests.

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Copilot

Pull request overview

This PR enhances the OnnxDiscrepancyCheck pass by adding an inference speedup measurement (ONNX vs PyTorch) and introducing configurable warmup/timing iteration settings, with updated defaults and tests to validate the new behavior.

Changes:

Added warmup_iterations and timing_iterations config parameters (defaulting timing_iterations to 5) and implemented speedup measurement with an option to skip when timing_iterations=0.
Updated session/device setup to target the configured accelerator (with CPU fallback) and run PyTorch on the matched torch device.
Added unit tests to validate the new default and the skip behavior.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`olive/passes/onnx/discrepancy_check.py`	Adds speedup timing, new config params/defaults, and device-aware session/model handling.
`test/passes/onnx/test_discrepancy_check.py`	Adds tests for the new `timing_iterations` default and the “skip when 0” behavior.

Copilot · 2026-06-11T17:50:32Z

+        return speedup
+
    def compare_generation(self, config: type[BasePassConfig], ref_model) -> int:


This is already addressed in commit 595bffc. input_ids is moved to ref_model.device (line 389) and the output is moved back to CPU via .cpu() before .tolist() (line 398).

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

xadupre · 2026-06-11T17:45:59Z

@copilot test/passes/onnx/test_discrepancy_check.py:105: in test_compare_generation_returns_common_prefix_length
result = pass_instance.compare_generation(config, mock_ref_model)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
olive/passes/onnx/discrepancy_check.py:389: in compare_generation
input_ids = input_ids.to(ref_model.device)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
E TypeError: to() received an invalid combination of arguments - got (MagicMock), but expected one of:
E * (torch.device device = None, torch.dtype dtype = None, bool non_blocking = False, bool copy = False, *, torch.memory_format memory_format = None)
E * (torch.dtype dtype, bool non_blocking = False, bool copy = False, *, torch.memory_format memory_format = None)
E * (Tensor tensor, bool non_blocking = False, bool copy = False, *, torch.memory_format memory_format = None)

Copilot · 2026-06-11T17:58:53Z

@copilot test/passes/onnx/test_discrepancy_check.py:105: in test_compare_generation_returns_common_prefix_length
result = pass_instance.compare_generation(config, mock_ref_model)
^^...

Fixed in 25c6ab4. Added mock_ref_model.device = torch.device("cpu") to both compare_generation test cases so that the mock returns a proper torch.device object instead of a MagicMock.

add speedup to OnnxDiscrepancyCheck

7713d07

xadupre commented Jun 5, 2026

View reviewed changes

Comment thread olive/passes/onnx/discrepancy_check.py

Copilot started work on behalf of xadupre June 5, 2026 17:15 View session

Fix discrepancy speedup defaults and zero-iteration skip

510a86a

Copilot AI changed the title ~~add speedup to OnnxDiscrepancyCheck~~ Add OnnxDiscrepancyCheck speedup metric with default timing updates Jun 5, 2026

Copilot finished work on behalf of xadupre June 5, 2026 17:25

xadupre marked this pull request as ready for review June 5, 2026 17:28

Copilot AI review requested due to automatic review settings June 5, 2026 17:28

Copilot started reviewing on behalf of xadupre June 5, 2026 17:28 View session

github-advanced-security AI found potential problems Jun 5, 2026

View reviewed changes

Comment thread test/passes/onnx/test_discrepancy_check.py Fixed

Comment thread test/passes/onnx/test_discrepancy_check.py Fixed

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Copilot started work on behalf of xiaoyu-work June 5, 2026 20:40 View session

Suppress protected-access pylint warnings in discrepancy check tests

3a43a02

Copilot finished work on behalf of xiaoyu-work June 5, 2026 20:47

Copilot AI requested a review from xiaoyu-work June 5, 2026 20:47

Potential fix for pull request finding

454d9a9

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of xadupre June 8, 2026 12:43 View session

xadupre and others added 2 commits June 8, 2026 14:43

Merge branch 'main' into xadupre/speedup

a1e7586

Fix generation tensor device handling

595bffc

Copilot finished work on behalf of xadupre June 8, 2026 12:50

Merge branch 'main' into xadupre/speedup

5629eda

Copilot started work on behalf of xadupre June 11, 2026 17:45 View session

Copilot finished work on behalf of xadupre June 11, 2026 17:50

Copilot started work on behalf of xadupre June 11, 2026 17:51 View session

Fix mock_ref_model.device in compare_generation tests

25c6ab4

Copilot finished work on behalf of xadupre June 11, 2026 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OnnxDiscrepancyCheck speedup metric with default timing updates#2502

Add OnnxDiscrepancyCheck speedup metric with default timing updates#2502
xadupre wants to merge 8 commits into
mainfrom
xadupre/speedup

xadupre commented Jun 5, 2026 •

edited by Copilot AI

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Jun 11, 2026

Uh oh!

xadupre commented Jun 11, 2026

Uh oh!

Copilot AI commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return speedup

		def compare_generation(self, config: type[BasePassConfig], ref_model) -> int:

Conversation

xadupre commented Jun 5, 2026 • edited by Copilot AI Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Checklist before requesting a review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

xadupre commented Jun 11, 2026

Uh oh!

Copilot AI commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xadupre commented Jun 5, 2026 •

edited by Copilot AI

Loading