fix: disallow leading zeros in RLP decoding #335

Rjected · 2023-10-19T18:27:13Z

Motivation

Currently Uint RLP decoding ignores leading zeros.

Solution

Disallow leading zeros.

PR Checklist

Added Tests
Added Documentation
Updated the changelog

prestwich · 2023-10-19T19:43:28Z

This would be a breaking behavior change. Is there a motivating reason?

codecov · 2023-10-19T19:50:18Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (8089eab) 80.48% compared to head (867476a) 80.46%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #335      +/-   ##
==========================================
- Coverage   80.48%   80.46%   -0.03%     
==========================================
  Files          54       54              
  Lines        6140     6179      +39     
==========================================
+ Hits         4942     4972      +30     
- Misses       1198     1207       +9

Files	Coverage Δ
src/support/alloy_rlp.rs	`100.00% <100.00%> (ø)`
src/support/fastrlp.rs	`98.07% <92.85%> (-0.82%)`	⬇️

... and 4 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Rjected · 2023-10-19T19:57:01Z

This would be a breaking behavior change. Is there a motivating reason?

Sorry, should have been more specific in the PR description. The main motivation is that leading zeros on RLP uints is defined to be invalid, and encoding would also never output uints encoded this way. This also makes writing certain kinds of fuzz tests possible, specifically fuzz tests that:

Take fuzz data as input
Decode TX or other struct with an int
if successful, encode struct
compare encoding output to bytes input

If we accept invalid RLP input, and are more permissive in decoding than encoding as a result, these fuzz tests quickly fail and do not act as meaningful fuzz tests.

Rjected · 2023-10-27T17:21:49Z

@prestwich just bumping, any thoughts on this given the context?

prestwich · 2023-10-28T00:32:42Z

The ethereum.org spec is clear as mud, but the yellowpaper appears to be unambiguous about this (see (197)) . Accepting the non-minimal uint is off-spec, so we should fix it. However, because this silently changes behavior in a way that breaks downstream users, we should release it as a major version bump

prestwich

Update the changelog :)

recmo

LGTM. Bug fix. Not a breaking change.

src/support/alloy_rlp.rs

recmo · 2023-10-28T11:52:47Z

src/support/alloy_rlp.rs

+        // leading zeros
+        if !bytes.is_empty() && bytes[0] == 0 {
+            return Err(Error::LeadingZero);
+        }


RLP spec explicitly says

integers must be represented in big-endian binary form with no leading zeroes (thus making the integer value zero equivalent to the empty byte array).

meaning that the old accepting behaviour was not according to spec and a bug. Fixing bugs is not a breaking change.

There's 'robustness principle' argument that could be made to make this error optionally a warning (i.e. introduce a non-strict mode). But that seems more effort than it's worth. Unless someone volunteers the strict mode should be default.

recmo · 2023-10-28T12:05:28Z

If we accept invalid RLP input, and are more permissive in decoding than encoding as a result, these fuzz tests quickly fail and do not act as meaningful fuzz tests.

This proposed test assumes that encoding is canonical. This assumption is only true when leading zeros are rejected.

[ethereum.org] and [yellow paper]

Both yellow paper and ethereum.org spec clearly state leading zeros are an error. While neither mention it explicitly, the implicit goal is to make it such that there is only one canonical RLP encoding. This is important since it RLP encoding is often used to hash data structures. Having multiple valid hashes for the same data structure leads to all forms of madness.

recmo · 2023-10-28T12:07:21Z

However, because this silently changes behavior in a way that breaks downstream users, we should release it as a major version bump

The accepting behavior was a bug, any downstream users depending on this bug are in error (as is clear by the RLP spec). IMO this a bug fix and not a breaking change.

prestwich · 2023-10-28T18:46:05Z

I've also submitted a PR to the RLP spec doc on the Ethereum website to disambiguate this

the worst part of this is how it breaks the layer boundary between RLP and "higher-order" protocols. You cannot accurately decode RLP without knowing whether the higher-order protocol contains positive integers

ethereum/ethereum-org-website#11532

Rjected · 2023-10-30T15:01:19Z

Updated the changelog and made other requested changes, @prestwich mind reviewing again?

DaniPopes

Can you also update fastrlp? It should be an easy copy-paste

Rjected · 2023-10-30T15:30:17Z

Can you also update fastrlp? It should be an easy copy-paste

good catch, will do

prestwich

one more changelog nit

CHANGELOG.md

prestwich

🙌

Rjected added 3 commits October 19, 2023 14:23

add tests for uint leading zeros

d19a360

todo for the checks

a3c262c

implement leading zero check

5fe0d22

Rjected marked this pull request as ready for review October 19, 2023 20:03

Rjected requested a review from prestwich as a code owner October 19, 2023 20:03

This was referenced Oct 23, 2023

feat: roundtrip fuzz harness for PooledTransactions paradigmxyz/reth#5125

Merged

fix: check payload length and consumed buf for pooled tx paradigmxyz/reth#5153

Merged

Merge branch 'main' into fix-leading-zero-rlp

4dcf375

prestwich requested changes Oct 28, 2023

View reviewed changes

recmo approved these changes Oct 28, 2023

View reviewed changes

Rjected added 2 commits October 29, 2023 13:19

update changelog and clarify comments with spec link

8d3b13b

cargo fmt

901d01d

DaniPopes suggested changes Oct 30, 2023

View reviewed changes

Rjected added 3 commits October 30, 2023 12:57

copy alloy rlp implementation to fastrlp

44cc9b1

fix error

0b6d46e

no decode_bytes for fastrlp header

fca1d73

prestwich requested changes Oct 30, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

move bugfix into unreleased fixed section

867476a

prestwich approved these changes Oct 31, 2023

View reviewed changes

prestwich merged commit 8229a30 into recmo:main Oct 31, 2023
19 of 20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: disallow leading zeros in RLP decoding #335

fix: disallow leading zeros in RLP decoding #335

Rjected commented Oct 19, 2023 •

edited

Loading

prestwich commented Oct 19, 2023

codecov bot commented Oct 19, 2023 •

edited

Loading

Rjected commented Oct 19, 2023

Rjected commented Oct 27, 2023

prestwich commented Oct 28, 2023

prestwich left a comment

recmo left a comment

recmo Oct 28, 2023

recmo commented Oct 28, 2023

recmo commented Oct 28, 2023

prestwich commented Oct 28, 2023

Rjected commented Oct 30, 2023

DaniPopes left a comment

Rjected commented Oct 30, 2023

prestwich left a comment

prestwich left a comment

fix: disallow leading zeros in RLP decoding #335

fix: disallow leading zeros in RLP decoding #335

Conversation

Rjected commented Oct 19, 2023 • edited Loading

Motivation

Solution

PR Checklist

prestwich commented Oct 19, 2023

codecov bot commented Oct 19, 2023 • edited Loading

Codecov Report

Rjected commented Oct 19, 2023

Rjected commented Oct 27, 2023

prestwich commented Oct 28, 2023

prestwich left a comment

Choose a reason for hiding this comment

recmo left a comment

Choose a reason for hiding this comment

recmo Oct 28, 2023

Choose a reason for hiding this comment

recmo commented Oct 28, 2023

recmo commented Oct 28, 2023

prestwich commented Oct 28, 2023

Rjected commented Oct 30, 2023

DaniPopes left a comment

Choose a reason for hiding this comment

Rjected commented Oct 30, 2023

prestwich left a comment

Choose a reason for hiding this comment

prestwich left a comment

Choose a reason for hiding this comment

Rjected commented Oct 19, 2023 •

edited

Loading

codecov bot commented Oct 19, 2023 •

edited

Loading