[benchmarks] Add docs file. #8023

ysiraichi · 2024-09-16T16:53:37Z

This PR introduces a docs/torchbench.md file. It explains how to use the benchmarking scripts for running, troubleshooting, and debugging the models inside Torchbench.

miladm · 2024-09-16T16:54:38Z

docs/torchbench.md

+
+**PyTorch/XLA Metrics:** (repeat-specific) the flag `--dump-pytorch-xla-metrics` creates a
+new file, dumping PyTorch/XLA metrics, such as graph compiling and execution information.
+


Can we have a section on Nightly CI runs - TPU and GPU

Sure. But I don't really have any information on those nightly CI runs. What should it contain?

@zpcore can you please help Yukio with this section?

additional: ideally, we also want to run verifier as part of our tests

miladm · 2024-09-16T16:57:02Z

docs/torchbench.md

+
+**PyTorch/XLA Metrics:** (repeat-specific) the flag `--dump-pytorch-xla-metrics` creates a
+new file, dumping PyTorch/XLA metrics, such as graph compiling and execution information.
+


Can we include a troubleshooting section? this should include the pitfalls we ran into as well as approaches to debug the benchmark efficiently.

@miladm I don't think it is appropriate to document any resolved issue here. If there are any features that can be used to debug different performance issues we should absolutely make sure they are captured.

miladm · 2024-09-23T17:02:05Z

@ysiraichi

can we have a section called "tips and tricks" covering some of the ways we found helping with the performance gain of torch_xla. This can include flags like functionalization, cuda fallback, cuda garphs, etc.

Started writing doc.

8fe806b

miladm reviewed Sep 16, 2024

View reviewed changes

ysiraichi added 9 commits September 17, 2024 15:52

Add more information.

657223e

Add verification_code information.

d278516

Briefly mention the output format.

bedb72d

Add a small guide for running the verifier.

45718c3

Mention --repeat and --iterations-per-run parameters.

cf35b0e

Add other troubleshooting points.

6a635fe

Add chrome-tracing image.

04c2874

Fix session links.

f425aaa

Add information about the YAML configuration file.

acfe3e3

ysiraichi mentioned this pull request Sep 23, 2024

Failing Torchbench Models: tracking issue #5932

Open

ysiraichi marked this pull request as ready for review September 23, 2024 17:00

ysiraichi mentioned this pull request Sep 25, 2024

Run the verifier on Torchbench in the Nigthly CI. #8070

Open

ysiraichi added 3 commits September 25, 2024 16:34

Add tips and tricks section.

cfc7165

Add more tips.

647510f

Add nigthly CI section.

b4c3f70

qihqi requested review from miladm and amjames October 31, 2024 16:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[benchmarks] Add docs file. #8023

[benchmarks] Add docs file. #8023

ysiraichi commented Sep 16, 2024 •

edited

Loading

miladm Sep 16, 2024 •

edited

Loading

ysiraichi Sep 16, 2024

miladm Sep 23, 2024

miladm Sep 23, 2024

miladm Sep 16, 2024

amjames Sep 19, 2024

miladm commented Sep 23, 2024


		PyTorch/XLA Metrics: (repeat-specific) the flag `--dump-pytorch-xla-metrics` creates a
		new file, dumping PyTorch/XLA metrics, such as graph compiling and execution information.

[benchmarks] Add docs file. #8023

Are you sure you want to change the base?

[benchmarks] Add docs file. #8023

Conversation

ysiraichi commented Sep 16, 2024 • edited Loading

miladm Sep 16, 2024 • edited Loading

Choose a reason for hiding this comment

ysiraichi Sep 16, 2024

Choose a reason for hiding this comment

miladm Sep 23, 2024

Choose a reason for hiding this comment

miladm Sep 23, 2024

Choose a reason for hiding this comment

miladm Sep 16, 2024

Choose a reason for hiding this comment

amjames Sep 19, 2024

Choose a reason for hiding this comment

miladm commented Sep 23, 2024

ysiraichi commented Sep 16, 2024 •

edited

Loading

miladm Sep 16, 2024 •

edited

Loading