Benchmarking external tests #12441

cameel · 2021-12-20T19:57:51Z

~~Depends on #12440.~~ Merged.
~~Depends on #12532.~~ Merged.
~~Depends on #12629.~~ Merged.

This PR adds a mechanism for gathering a set of metrics from all external tests and combining them into an easily diffable JSON report.

Currently it gathers the following information:

Gas usage as reported by the eth-gas-reporter plugin for Hardhat and Truffle. The plugin watches all function calls and contract deployments during test execution, measures used gas and presents results in an RST table.
Bytecode size extracted from Truffle and Hardhat artifacts with jq.

The biggest part (report creation) is already implemented but there are still a few small things to iron out:

Make sure the plugin works and actually produces reports in all external projects.
- ~~Colony~~ (only runs on nigthly)
- ~~Gnosis Safe~~ (will work after Run GnosisSafe external tests with Hardhat and directly on upstream #12195)
- OpenZeppelin
- ENS
Skipping the report in compile-only runs.
Attaching reports as artifacts in CI.
Collector job to gather all artifacts and combine them into a single JSON file.
Simple command to process the combined JSON report into a short summary containing only totals.

cameel · 2021-12-22T23:02:57Z

This is pretty much done now.

Benchmark results for Gnosis are still incomplete, probably because its tests are not based on mocha but they will work after the switch to Hardhat in #12195 so there's no point fixing that here. For Colony I'm not sure if they're complete but it only runs on nightly so I'm leaving it as is for now. We should update it to the latest version first anyway.

When reviewing check the artifacts of the c_ext_benchmarks job. This is the primary output we'll use for benchmarking.

cameel · 2022-01-21T14:45:33Z

Rebased on the Uniswap PR (#12532) since that's likely to get merged soon and that way I can already include it here..

cameel · 2022-02-04T18:30:21Z

I updated the code for new external tests. This should now pass CI.

I also fixed an issue with benchmark results being missing for PRBMath and Bleeps. It looks like running tests via mocha suppresses the output from hardhat-gas-reporter. I had to change it so that tests are executed directly with hardhat test.

Unfortunately results seem incomplete in both cases. In Bleeps the table has no deployment costs while in PRBMath method call costs are missing. I'm not sure why so I asked for help in PaulRBerg/prb-math#70.

In any case, I think we should merge it even in the current state. Benchmark results are just extra information and jobs are on purpose designed in such a way that not having them does not make them fail.

cameel · 2022-02-04T22:11:36Z

Regarding no calls in PRBMath - mystery solved. All the methods called in tests are pure or view so there's no actual transaction happening and and eth_call does not report gas usage.

There are plans to make hardhat-gas-reporter inject eth_estimageGas instead and once that happens, we should automatically start seeing gas values for these methods in our benchmarks.

cameel · 2022-02-04T22:43:55Z

As for Bleeps, the issue is caused by the use of hardhat-deploy plugin. It deploys the contracts before tests are executed and hardhat-gas-reporter does not monitor these calls. The project actually has a separate command to report deployment gas. There's an issue to fix that in hardhat-gas-reporter though: cgewecke/hardhat-gas-reporter#86. So this is another thing that should just work once it's fixed upstream.

ekpyron

I vote for just merging this and seeing how it performs.
I by far have not done a complete line-by-line review, but since
(1) all of this is purely informative and non-critical, yet also (2) the information
will be quite helpful and useful, I think it may be reasonable to merge
and see how well it works without a full rigorous review.

cameel added testing 🔨 has dependencies The PR depends on other PRs that must be merged first labels Dec 20, 2021

cameel self-assigned this Dec 20, 2021

cameel marked this pull request as draft December 20, 2021 21:02

cameel force-pushed the preset-selection-in-ext-tests branch from b1ce656 to 37f2c54 Compare December 21, 2021 13:13

cameel force-pushed the benchmarking-ext-tests branch from 9bc1272 to 2258a08 Compare December 21, 2021 14:32

cameel force-pushed the preset-selection-in-ext-tests branch from 37f2c54 to e22f0cf Compare December 22, 2021 12:06

cameel force-pushed the benchmarking-ext-tests branch 2 times, most recently from 7a5c835 to 20c256d Compare December 22, 2021 12:36

cameel force-pushed the preset-selection-in-ext-tests branch from e22f0cf to c40a44f Compare December 22, 2021 15:41

cameel force-pushed the benchmarking-ext-tests branch 8 times, most recently from 9280182 to 7864565 Compare December 22, 2021 22:56

cameel marked this pull request as ready for review December 22, 2021 23:03

cameel mentioned this pull request Jan 10, 2022

External tests for sushiswap/trident #12197

Merged

cameel force-pushed the preset-selection-in-ext-tests branch from c40a44f to 1928b78 Compare January 10, 2022 13:44

cameel force-pushed the benchmarking-ext-tests branch from 7864565 to 39b6206 Compare January 10, 2022 13:44

Base automatically changed from preset-selection-in-ext-tests to develop January 10, 2022 20:15

cameel force-pushed the benchmarking-ext-tests branch from 39b6206 to 4806603 Compare January 11, 2022 12:31

cameel mentioned this pull request Jan 12, 2022

External test benchmarking extensions #12522

Closed

10 tasks

cameel removed the has dependencies The PR depends on other PRs that must be merged first label Jan 18, 2022

cameel marked this pull request as draft January 18, 2022 00:41

cameel force-pushed the benchmarking-ext-tests branch 2 times, most recently from 14ed1af to 4c76add Compare January 19, 2022 19:07

cameel force-pushed the benchmarking-ext-tests branch from fad1902 to 7b943d4 Compare January 21, 2022 15:11

cameel force-pushed the uniswap-ext-test branch from 1a6c62e to aeb9637 Compare January 21, 2022 15:13

cameel mentioned this pull request Jan 21, 2022

Force-enable stack-to-memory to see how it affects external tests. #12571

Closed

Base automatically changed from uniswap-ext-test to develop January 21, 2022 21:03

cameel removed the has dependencies The PR depends on other PRs that must be merged first label Jan 21, 2022

cameel force-pushed the benchmarking-ext-tests branch 2 times, most recently from f765830 to 0ee4835 Compare January 24, 2022 13:04

cameel force-pushed the benchmarking-ext-tests branch from 0ee4835 to 741404b Compare February 4, 2022 14:12

cameel mentioned this pull request Feb 4, 2022

Re-enable Bleeps external test without the failing governor test #12629

Merged

cameel force-pushed the benchmarking-ext-tests branch from 741404b to 55a7ca5 Compare February 4, 2022 14:23

cameel added the has dependencies The PR depends on other PRs that must be merged first label Feb 4, 2022

cameel changed the base branch from develop to reenable-bleeps-ext-test-without-governor-test February 4, 2022 14:24

Base automatically changed from reenable-bleeps-ext-test-without-governor-test to develop February 4, 2022 14:51

cameel removed the has dependencies The PR depends on other PRs that must be merged first label Feb 4, 2022

cameel force-pushed the benchmarking-ext-tests branch from d9db164 to 77afad2 Compare February 4, 2022 18:23

cameel added 5 commits February 9, 2022 17:02

CI: Fix job name for PRBMath external test

d511fe9

externalTests: Clean the build/ dir for Hardhat too

3e1aee1

externalTests: Make comments about failing presets less terse

7fc2253

Python script for parsing eth-gas-reporter output

a7852cb

externalTests: Benchmark reports

c6094bb

cameel force-pushed the benchmarking-ext-tests branch from 77afad2 to 26afbf3 Compare February 9, 2022 16:02

Benchmark report collector job + summary

60d9aa0

cameel force-pushed the benchmarking-ext-tests branch from 26afbf3 to 60d9aa0 Compare February 9, 2022 16:54

ekpyron approved these changes Feb 14, 2022

View reviewed changes

leonardoalt merged commit 947a599 into develop Feb 14, 2022

leonardoalt deleted the benchmarking-ext-tests branch February 14, 2022 19:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking external tests #12441

Benchmarking external tests #12441

cameel commented Dec 20, 2021 •

edited

Loading

cameel commented Dec 22, 2021 •

edited

Loading

cameel commented Jan 21, 2022

cameel commented Feb 4, 2022

cameel commented Feb 4, 2022

cameel commented Feb 4, 2022

ekpyron left a comment

Benchmarking external tests #12441

Benchmarking external tests #12441

Conversation

cameel commented Dec 20, 2021 • edited Loading

cameel commented Dec 22, 2021 • edited Loading

cameel commented Jan 21, 2022

cameel commented Feb 4, 2022

cameel commented Feb 4, 2022

cameel commented Feb 4, 2022

ekpyron left a comment

Choose a reason for hiding this comment

cameel commented Dec 20, 2021 •

edited

Loading

cameel commented Dec 22, 2021 •

edited

Loading