Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Measure the accuracy of log classifier ruleset #5826

Open
huydhn opened this issue Oct 25, 2024 · 0 comments
Open

Measure the accuracy of log classifier ruleset #5826

huydhn opened this issue Oct 25, 2024 · 0 comments

Comments

@huydhn
Copy link
Contributor

huydhn commented Oct 25, 2024

Log classifier is the tool that PyTorch CI is using to extract relevant failures from the log. It depends on a rule set defined at https://github.com/pytorch/test-infra/blob/main/aws/lambda/log-classifier/ruleset.toml in which the rules are evaluated from top to bottom in a first-match basis. The first line in the logs that matches one of the rule is consider the relevant error.

The last rule in the list is the catch-all GHA error, when it matches, it means that the log classifier has failed to find the relevant failures. So, the log classifier accuracy is 1 - % of catch-all GHA error.

Notes to bootcampers:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Cold Storage
Development

No branches or pull requests

1 participant