Benchmark and optimize analyze call #2033

webmaster128 · 2024-02-27T13:54:24Z

During manual testing I realized that analyze can be a bit slow, depending on the contract. This might be due to a heavy ParsedWasm::parse or maybe even due to loading the whole Wasm from disk at once.

In CosmWasm/wasmd#1813 we start exploring first steps towasds mass migrations. In order to ensure MsgMigrateContract is performant (dozens, maybe hundreds of migrations in a block), we also need to ensure that the Analyze call is fast.

Create benchmark for analyze for different contracts
Evaluate and implement performance optimizations

The text was updated successfully, but these errors were encountered:

aumetra · 2024-03-12T10:27:05Z

After adding some more of the testdata contracts to the benchmark harness and running it through cargo-flamegraph, it seems like it spends most if its time inside wasmparser::validator::func::FuncValidator.

In there it steps through the function body byte-by-byte and validates that it's well-formed. Makes sense that we have that step. Since that is part of Bytecode Alliance's wasmparser, I'm not sure how much we could actually optimize this on our side.

Some bench examples when removing the validation of the function body:

Cache/analyze_cyberpunk.wasm
                        time:   [168.58 µs 168.66 µs 168.80 µs]
                        change: [-83.658% -83.618% -83.580%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 1 outliers among 12 measurements (8.33%)
  1 (8.33%) high mild
Cache/analyze_cyberpunk_rust170.wasm
                        time:   [157.20 µs 157.61 µs 158.03 µs]
                        change: [-79.398% -79.357% -79.303%] (p = 0.00 < 0.05)
                        Performance has improved.
Cache/analyze_floaty.wasm
                        time:   [117.18 µs 117.26 µs 117.37 µs]
                        change: [-82.821% -82.791% -82.763%] (p = 0.00 < 0.05)
                        Performance has improved.
Cache/analyze_hackatom.wasm
                        time:   [124.91 µs 125.00 µs 125.11 µs]
                        change: [-83.251% -83.200% -83.159%] (p = 0.00 < 0.05)
                        Performance has improved.
Found 1 outliers among 12 measurements (8.33%)
  1 (8.33%) high mild

As apparent from the benches, we see an ~80% improvement in performance across the board and are comfortably entering the low three-digit microsecond territory.

webmaster128 · 2024-03-12T10:57:57Z

In there it steps through the function body byte-by-byte and validates that it's well-formed. Makes sense that we have that step. Since that is part of Bytecode Alliance's wasmparser, I'm not sure how much we could actually optimize this on our side.

Something we can do to improve this is pull out the function validation into a separate step. Then

In fn check_wasm we stop the execution on error (if any) early and only perform the function validation if everything else is ok
In fn analyze we don't need to do the function validation at all since the Wasm file was checked before and we don't use that extra data

aumetra · 2024-03-12T11:00:20Z

Makes sense, will look into that.

I also managed to get some performance improvements (averaging at ~22%) by parallelising the function body validation (without reusing the allocations from the validator since that would add locking overhead)

webmaster128 added the requested-by-gold label Feb 27, 2024

webmaster128 added this to the 2.1.0 milestone Feb 27, 2024

aumetra self-assigned this Mar 12, 2024

aumetra mentioned this issue Mar 12, 2024

Extend analyze bench suite #2049

Merged

aumetra mentioned this issue Mar 12, 2024

Improve performance of Cache::analyze #2051

Merged

aumetra closed this as completed in #2051 Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark and optimize analyze call #2033

Benchmark and optimize analyze call #2033

webmaster128 commented Feb 27, 2024 •

edited by aumetra

Loading

aumetra commented Mar 12, 2024

webmaster128 commented Mar 12, 2024

aumetra commented Mar 12, 2024

Benchmark and optimize analyze call #2033

Benchmark and optimize analyze call #2033

Comments

webmaster128 commented Feb 27, 2024 • edited by aumetra Loading

aumetra commented Mar 12, 2024

webmaster128 commented Mar 12, 2024

aumetra commented Mar 12, 2024

webmaster128 commented Feb 27, 2024 •

edited by aumetra

Loading