Use unittest XML files to parse PyTorch test results #3633

Flamefire · 2025-02-21T12:00:12Z

(created using eb --new-pr)

Requires

Ignore other classes if software specific easyblock class was found easybuild-framework#4769
enhance apply_regex_substitutions easybuild-framework#4758
{tools}[GCCcore/10.3.0 - 14.2.0] unittest-xml-reporting v3.1.0, lxml v5.3.0, libxslt v1.1.42 easybuild-easyconfigs#22205
Python 3.6+ (So close to EB 5 release it didn't seem necessary to make it even harder by supporting Python 2/3.5). IIRC the file is only imported when installing PyTorch or a dependency so even for EB 4.x the restriction is only that installing PyTorch with Python < 3.6 doesn't work anymore. It allows using e.g. f-strings and some parts of the type-hint system don't work in 3.5 or before

Some explanations:

In a discussion in a PyTorch issue the only machine readable output are the test XML files which are only generated on (their) CI
The easyblock applies patches to allow enabling test reports by setting an EasyBuild specific variable.
- There is an option to pass to run_test.py that is supposed to enable that but it isn't passed to subprocess and hence not reliable
- Another bug results in not generating test reports outside CI even with the option passed
- As the patched file gets installed we shouldn't change its (default) behavior in case users use it, hence the env variable
The PyTorch test suite uses Python unittest, pytest and since 2.3 a custom logic to rerun failed tests. That generates XML result files in different formats and potentially with duplicates
- The successful reruns might be reported alongside their previous failures but in different files so "merging" is required to keep only the successful ones
- The implemented parser collects all results and attributes them to their "test suite" (usually the Python file executed which might include or run other files)
- Some tests are run multiple times in different configurations, i.e. the same test file is executed multiple times with an environment variable set to choose e.g. the distribution backend. Those need to be considered as separate tests
- Afterwards all results are combined/merged. Each test from the same test suite that is found multiple times is considered as successful if at least one of the duplicates was successful
I used Python type hints to make it a bit easier to follow
In many places assumptions are verified by raising a descriptive error. This should allow to detect changes in PyTorch that affect the logic
The "old" (current) parsing of the stdout is still used
- The new logic is only enabled when the PyTorch easyconfig has the required xmlrunner Python package directly or transitively. We have unittest-xml-reporting ECs for that
- For PyTorch < 2.3 (Since 2.3 that parser isn't really useful anymore) the found results are compared and differences shown in the logfile. They should match of course, but in the end the result from the XML files is used
- The final output of PyTorchs run_tests.py contains a list of failed test suites. We match against that as before to detect when we missed something.
- That also detects test suites that failed to start due to e.g. syntax errors introduced by our patches. In that case no XML file is generated and we'd miss it but we should handle all those cases by fixing the issue or skipping the test
I considered verifying the found suites against the list of suites to run as printed by run_test.py but some of the test files are missing the code required to start the test and hence show up in that list but produce no output at all
The easyblock file can be run directly and accepts:
- An easybuild log file: Parses the stdout of run_test as found in the log to test the old parser. This exists already
- A directory: Run the new (XML) parser on a test-results folder containing the XML reports

I prepared PRs for new and old PyTorch ECs to include the dependency required for the XML reporting. Those can be used to test this PR:

Flamefire added 2 commits February 21, 2025 13:00

Use unittest XML files to parse PyTorch test results

e8b4c18

Use a variable to avoid enabling test reports after installation

975898e

Make parsing more reliable and resistent

1bd2c92

This was referenced Feb 26, 2025

{ai}[foss/2023b] PyTorch v2.2.1 easybuilders/easybuild-easyconfigs#22361

Open

Add unittest-xml-reporting build dependency to PyTorch 2.1.2 (foss 2023a/b) easybuilders/easybuild-easyconfigs#22359

Open

Flamefire added 3 commits February 26, 2025 14:24

Catch more failed suites

00c292c

Reuse pattern for failed suite name to always remove shard info

097c721

Deduplicate failed test names

83d2e3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use unittest XML files to parse PyTorch test results #3633

Use unittest XML files to parse PyTorch test results #3633

Flamefire commented Feb 21, 2025 •

edited

Loading

Use unittest XML files to parse PyTorch test results #3633

Are you sure you want to change the base?

Use unittest XML files to parse PyTorch test results #3633

Conversation

Flamefire commented Feb 21, 2025 • edited Loading

Flamefire commented Feb 21, 2025 •

edited

Loading