Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix unbounded vec deserialization #592

Closed
wants to merge 1 commit into from

Conversation

HarukaMa
Copy link
Contributor

@HarukaMa HarukaMa commented Jan 16, 2022

Motivation

The Vec deserialization procedure of CanonicalDeserialize didn't check if the vector size is reasonable which causes https://github.com/AleoHQ/snarkOS/issues/1534.

This PR limits the data size to 1GB without counting Vec overhead which should be enough (?).

Edit: looks like Transactions, Transition and Event are safe.

Test Plan

Currently the tests of CanonicalSerialize and CanonicalDeserialize only checks if the data is intact after a se/des cycle so didn't add any test there.

I'm using a fake client to send bad proof data to test it. It seems there is no significant memory usage changes when I rapidly trigger a fairly large allocation (several hundred MBs).

Related PRs

(Link your related PRs here)

@ljedrz
Copy link
Collaborator

ljedrz commented Jan 16, 2022

Thanks for the contribution! Does this fix the issue for any hand-crafted message you tested? I have a branch with panic catches around calls to Vec::with_capacity, but this spot (which was also my first target) wasn't sufficient for the operator to no longer crash - I'll share it once I'm at the PC.

@HarukaMa
Copy link
Contributor Author

I've tried with different vec sizes and when the size exceeds 1GB it will directly err out without crashing.

I guess you can't catch OOM errors on allocations as it doesn't print backtraces even with the env param. I guess the best way is to not try to alloc that large memory.

@ljedrz
Copy link
Collaborator

ljedrz commented Jan 16, 2022

Agreed, a fixed boundary is the safest approach and this change looks fine; I'll leave the decision regarding the maximum size to @howardwu.

@howardwu
Copy link
Member

howardwu commented Jan 17, 2022

Are we able to expand the maximum capacity beyond 1GB? While 1GB may seem large, it is small for our universal setup ceremony, which would need at least 10GB (if not more) for this data structure (IIRC).

Can you clarify what case is introducing a failure at 1GB in snarkOS?

@HarukaMa
Copy link
Contributor Author

HarukaMa commented Jan 17, 2022

@howardwu In the serialized proof, the size of each Vec is directly encoded in the message. Attacker could then manually craft a "proof" with some absurd Vec size like 1 trillion. Calling Vec::with_capacity with that size would result in trying to allocate a very large chunk of memory. Rust would simply abort on failed allocation (rust-lang/rust#29802).

There might still be different ways to handle this if imposing size limit is not that easy:

  • Use special deserialization on proofs, instead of using the generalized CanonicalDeserialize (like Transactions, which limits the Vec size to u16 thus not affected);
  • Don't pre-allocate memory and use with_capacity(0) instead, which might be slower due to required reallocs;
  • Make a custom allocator to handle situations like this.

Still, being able to accept arbitrary input from network with an unchecked size is not safe and should be carefully considered.

@ljedrz
Copy link
Collaborator

ljedrz commented Jan 17, 2022

Don't pre-allocate memory and use with_capacity(0) instead, which might be slower due to required reallocs

This will work, as long as we still do expect to read a length (just not pre-allocate based on it). We could also use something like Vec::try_reserve (which would bump the MSRV to 1.57).

Use special deserialization on proofs (...)

This would probably work best, even if it requires some tinkering around the overall serialization setup.

@Pratyush
Copy link
Contributor

Special deserialization for proofs makes sense to me, as the number of elements in the (commitment) vec is known statically: https://github.com/AleoHQ/snarkVM/blob/435f1120b15d0d63944b9935667b607084b83cef/marlin/src/ahp/ahp.rs#L63. We can do a similar analysis for the other collections used in the proof, like those used for evaluations.

@HarukaMa
Copy link
Contributor Author

Considering the proof is always 771 bytes on network, I think the whole structure is statically known.

Feel free to supersede this PR as I'm not sure how to correctly make a special deserializer for the proof. I'd recommend to put it on high priority though, as currently it's still possible to crash every node on the network with a specially crafted payload.

@ljedrz
Copy link
Collaborator

ljedrz commented Jan 21, 2022

Until proof-specific deserialization is implemented, I proposed a more generic solution in https://github.com/AleoHQ/snarkVM/pull/609.

@ljedrz ljedrz mentioned this pull request Apr 3, 2022
@howardwu
Copy link
Member

howardwu commented Apr 3, 2022

Closing as the issue has been addressed in #735

@howardwu howardwu closed this Apr 3, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants