add autonat v2 spec #538

sukunrt · 2023-04-11T07:43:17Z

First draft for autonat v2. #503

This protocol allows for testing reachability on exactly one address. This helps determine reachability at an address level. This also simplifies the protocol a lot.

I'll change the spec to reflect the discussion on dialing a different ip address from the nodes observed ip address: #536

Discussion for nonce in message is here: libp2p/go-libp2p#1480
and this comment in particular libp2p/go-libp2p#1480 (comment)

marten-seemann

Very nice, this is a solid starting point for the spec!

What's the plan for resolving #536? Would you open a new PR that targets this PR here?

autonat/README.md

autonat/autonat-v2.md

sukunrt · 2023-04-11T12:23:48Z

What's the plan for resolving #536? Would you open a new PR that targets this PR here?

Yes, I'll open a PR with the changes for #536.

autonat/autonat-v2.md

thomaseizinger

Exciting! Thanks for your work. Left some comments/questions :)

Sorry if they have already been answered somewhere.

autonat/autonat-v2.md

sukunrt · 2023-04-25T07:44:48Z

thanks for your review @thomaseizinger.
I'd like your opinion on these two issues

Proposal: use a list of addresses in priority order for autonat v2 dial requests #539
Proposal: allow AutoNAT to dial all IP addresses, without risking amplification attacks #536

mxinden

Great work @sukunrt. Thank you!

autonat/autonat-v2.md

thomaseizinger · 2023-04-27T16:35:40Z

thanks for your review @thomaseizinger. I'd like your opinion on these two issues

Proposal: use a list of addresses in priority order for autonat v2 dial requests #539
Proposal: allow AutoNAT to dial all IP addresses, without risking amplification attacks #536

I don't have anything to add to those at the moment :)

MarcoPolo

On a brief skim, this looks good! I'm curious if we'll want to relax the "implementations MUST NOT dial any multiaddress unless it is based on the IP address the requesting node is observed as". Would it be useful to do this, and we can mitigate the amplification attack some other way?

It seems like there's a healthy discussion already going on, so I'll step back here and let other folks stay involved. If there's anything I can help with, please don't hesitate to ping.

autonat/autonat-v2.md

sukunrt · 2023-04-27T19:22:51Z

Thanks for your review @MarcoPolo

It seems like there's a healthy discussion already going on, so I'll step back here and let other folks stay involved. If there's anything I can help with, please don't hesitate to ping.

The suggested strategy is discussed here: #536
Please check if we've made any errors there or overlooked something.

Here's the PR for those changes: #542
You can review it there, or here after I merge those changes.

umgefahren · 2024-01-28T15:54:31Z

While doing the rust-libp2p implementation, we discovered a race condition, which we are now circumventing by a 100ms delay. You can read the finally comment by @thomaseizinger here: umgefahren/rust-libp2p#1 (comment)

It happens when the server successfully performs a dial back, thus sends the confirmation of the address back to the client. However the client hasn't progressed enough to be notified of that successful dial back when receiving the confirmation. In that case the client wrongly assumed an address was confirmed where no dial back occurred.

thomaseizinger · 2024-01-29T04:45:26Z

In that case the client wrongly assumed an address was confirmed where no dial back occurred.

Minor correction here: The behaviour is usually that the client discards the "successful" confirmation because it has not yet processed the dial-back so it thinks the server is sending it a confirmation without having actually done the dial.

I think the correct way to solve this would be to add an ACK message from the client back to the server for the dial-back where the client can say: "Yes I've processed your dial-back". The server can then proceed to respond on the other stream and thus guarantee that we don't have a race condition between the two streams.

sukunrt · 2024-01-29T04:55:11Z

You can read the closing of the stream as the ACK. See: https://github.com/libp2p/go-libp2p/blob/sukun/autonat-v2-2/p2p/protocol/autonatv2/server.go#L251-L257

The spec also dictates closing the stream: https://github.com/libp2p/specs/blame/autonat-v2/autonat/autonat-v2.md#L87

Do you think an explicit ACK is better?

thomaseizinger · 2024-01-29T04:58:38Z

You can read the closing of the stream as the ACK. See: libp2p/go-libp2p@sukun/autonat-v2-2/p2p/protocol/autonatv2/server.go#L251-L257

The spec also dictates closing the stream: autonat-v2/autonat/autonat-v2.md#L87 (blame)

Do you think an explicit ACK is better?

Yeah I think so. I associate closing a stream with "I have no more data to write". The client never writes data so why wouldn't it immediately close the stream? Also, reading a stream and waiting for that to fail because it has been closed it also somewhat odd 🤷‍♂️

sukunrt · 2024-01-29T05:17:35Z

The client never writes data so why wouldn't it immediately close the stream?

That's a fair point. I'll add an ACK.

sukunrt · 2024-02-05T16:33:58Z

Updated the specs with a DialBackResponse

thomaseizinger

Nice, thank you!

Closes: #4524 This is the implementation of the evolved AutoNAT protocol, named AutonatV2 as defined in the [spec](https://github.com/libp2p/specs/blob/03718ef0f2dea4a756a85ba716ee33f97e4a6d6c/autonat/autonat-v2.md). The stabilization PR for the spec can be found under libp2p/specs#538. The work on the Rust implementation can be found in the PR to my fork: umgefahren#1. The implementation has been smoke-tested with the Go implementation (PR: libp2p/go-libp2p#2469). The new protocol addresses shortcomings of the original AutoNAT protocol: - Since the server now always dials back over a newly allocated port, this made #4568 necessary; the client can be sure of the reachability state for other peers, even if the connection to the server was made through a hole punch. - The server can now test addresses different from the observed address (i.e., the connection to the server was made through a `p2p-circuit`). To mitigate against DDoS attacks, the client has to send more data to the server than the dial-back costs. Pull-Request: #5526.

Stebalien · 2024-08-12T20:19:25Z

autonat/autonat-v2.md

+same key repeatedly. The only benefit of going via the server to do this attack
+is not spending bandwidth required for a handshake. So the prevention mechanism
+only focuses on bandwidth costs. There is a minor benefit of bypassing IP
+blocklists, but that's made unattractive by the fact that servers may ask 5x


I don't think we can simply shrug this off. This is called a reflection attack and has been a huge issue for open DNS resolvers.

Fixing the amplification side does go a long way, but paying a 5x bandwidth cost for a bunch of free IP addresses seems like a pretty reasonable tradeoff from an attacker's standpoint (especially because said attacker isn't paying for the bandwidth, but likely needs to compromise one machine per IP address).

Also note: home NAT users likely don't need this feature. That is:

They likely only need 1 dialable address.

They likely don't care which one.

Their outbound and inbound IPs are likely identical.

Being willing to dial other addresses does matter for, e.g., AWS and other special settings where there are separate ingress IP addresses. But, in that case, maybe the user should just configure their node correctly rather than relying on AutoNAT? AutoNAT specifically exists to enable home users.

It simplifies client implementations as they don't need to worry about IPv4 peer vs IPv6 peer. Though the benefit isn't huge since most IPv4 servers won't have IPv6 connectivity so they any way cannot check the IPv6 address.

Fixing the amplification side does go a long way, but paying a 5x bandwidth cost for a bunch of free IP addresses seems like a pretty reasonable tradeoff from an attacker's standpoint (especially because said attacker isn't paying for the bandwidth, but likely needs to compromise one machine per IP address).

Can you elaborate here? why isn't the attacker paying for the bandwidth.

This features enables the following attack:

Contact a large number of autonat v2 servers.

Give them the target's address to connect to

When requested for data: provide the data slowly:
* The stream timeout is 1 minute
* In the period, with a 1Gbps connection, you can send 60Gb ~= 6GB
* Maximum dial data requirement is 100kB
* So, in theory, you can run this with 60_000 peers in parallel.
* The servers have a random wait of up to 3 seconds precisely for this scenario. So in theory we can have 20k connections a second for 3 seconds to the target.

We can make a bunch of implementation improvements to reduce the harm here. The simplest ones being: Only wait 10 seconds for the dial data, and wait for 5 seconds before dialing. That would reduce the max new connection rate to 2k / second, which is very manageable.

The ideal solution is to introduce rate limits for new connections.

There's another problem with this feature related to implementation difficulty:

The primary use for this feature is to allow both IPv4 and IPv6 addresses to be tested without worrying about whether we have a v4 or a v6 connection. So you can ask a v4 peer to test your v6 address. This requires correctly reporting an error in case the server has no v6 connectivity, which is majority of servers.

I'm not sure if the rust implementation correctly handles this case. @umgefahren please correct me if I'm wrong here.
* See discussion around this comment: feat(autonatv2): Implement autonat v2 umgefahren/rust-libp2p#1 (comment)

I'm also not sure if we can rely on other implementations to correctly handle this. They might just make a dial, fail, and report unreachable.

If we have to ensure that we check v6 addresses with a v6 peer, it might be better to just disable this feature.

@MarcoPolo @Stebalien @umgefahren thoughts?

The potential attack vector seems to be described correctly, however I'm not sure that rust-libp2p is affected. The whole process is allowed to take a maximum of 10 seconds:
https://github.com/libp2p/rust-libp2p/blob/8ceadaac5aec4b462463ef4082d6af577a3158b1/protocols/autonat/src/v2/server/handler/dial_request.rs#L66
However, we don't wait any time before dealing back. This mitigation is a quick fix, I can prepare.

Regarding IPv4 and IPv6 I stand with @thomaseizinger's comment on that matter: umgefahren/rust-libp2p#1 (comment)

So it correctly handles that case, in that we don't generate false positives.

p-shahi · 2024-09-10T16:34:59Z

@sukunrt given that AutoNatv2 is merged in two reference implementations libp2p/rust-libp2p#5526 (released in 0.13.0) and libp2p/go-libp2p#2469 (released in 0.36.1)

Are there any outstanding comments that need to be addressed before this pull request can be merged? - if there are any that are non-blocking, can they be addressed in follow up PRs?
Also, the maturity should be either a Recommendation (I believe there is demonstrated interop between Go and Rust impls?)

umgefahren · 2024-09-10T17:01:03Z

Also, the maturity should be either a Recommendation (I believe there is demonstrated interop between Go and Rust impls?)

@sukunrt and I did interop testing and successfully verified that they are working together.

sukunrt · 2024-09-11T18:27:40Z

The implementation is not used in go-libp2p yet. We should merge this after we start inferring reachability in go-libp2p.

Closes: libp2p#4524 This is the implementation of the evolved AutoNAT protocol, named AutonatV2 as defined in the [spec](https://github.com/libp2p/specs/blob/03718ef0f2dea4a756a85ba716ee33f97e4a6d6c/autonat/autonat-v2.md). The stabilization PR for the spec can be found under libp2p/specs#538. The work on the Rust implementation can be found in the PR to my fork: umgefahren#1. The implementation has been smoke-tested with the Go implementation (PR: libp2p/go-libp2p#2469). The new protocol addresses shortcomings of the original AutoNAT protocol: - Since the server now always dials back over a newly allocated port, this made libp2p#4568 necessary; the client can be sure of the reachability state for other peers, even if the connection to the server was made through a hole punch. - The server can now test addresses different from the observed address (i.e., the connection to the server was made through a `p2p-circuit`). To mitigate against DDoS attacks, the client has to send more data to the server than the dial-back costs. Pull-Request: libp2p#5526.

MarcoPolo · 2024-09-30T18:41:04Z

autonat/autonat-v2.md

+
+This `DialRequest` message has a list of addresses and a fixed64 `nonce`. The
+list is ordered in descending order of priority for verification. AutoNAT V2 is
+only for testing reachability on Public Internet. Client SHOULD NOT send any


I think this should be "MUST NOT". and "The server MUST NOT dial any private address".

It is possible to implement these safely though. Both the client and the server need to check that the peer is connected over a private IP.

client 192.168.0.100 -> server 192.168.0.10
In this case it's reasonable for the client to ask the server to test its private IP reachability.

This is an edge case I'm willing to ignore though. Happy to change this to MUST, just that keeping it SHOULD allows some implementation to provide this feature if they're willing to.

Can you know that you are indeed on the same private network?

I'm not completely sure.

If you see local connection address in private IP range and remote connection address in private IP range, is that enough to conclude that you're in some private network?
Note you cannot rely on https://datatracker.ietf.org/doc/html/rfc1918 subnet masks as you can make a private network from a collection of smaller private networks.

I think if the local address and remote address are in the same private subnet, then it would be okay.

How about adding "The server SHOULD NOT dial any private address"? This leaves the door open in the spec.

I'm not sure the usefulness of doing this though, but maybe others might have a use for it.

sukunrt force-pushed the autonat-v2 branch from b95fb58 to 9e086f8 Compare April 11, 2023 07:46

sukunrt requested a review from marten-seemann April 11, 2023 07:51

sukunrt marked this pull request as ready for review April 11, 2023 12:05

sukunrt requested review from mxinden and MarcoPolo April 11, 2023 12:05

marten-seemann reviewed Apr 11, 2023

View reviewed changes

sukunrt marked this pull request as draft April 11, 2023 16:51

sukunrt force-pushed the autonat-v2 branch from 9e086f8 to f973931 Compare April 12, 2023 08:28

sukunrt changed the base branch from master to autonat-rename April 12, 2023 08:29

sukunrt force-pushed the autonat-v2 branch from f973931 to b3bd5e0 Compare April 12, 2023 09:04

add autonat v2 spec

d663611

sukunrt force-pushed the autonat-v2 branch from b3bd5e0 to d663611 Compare April 12, 2023 09:04

sukunrt mentioned this pull request Apr 12, 2023

Proposal: use a list of addresses in priority order for autonat v2 dial requests #539

Open

sukunrt marked this pull request as ready for review April 12, 2023 10:11

sukunrt commented Apr 15, 2023

View reviewed changes

autonat/autonat-v2.md Outdated Show resolved Hide resolved

use priority ordered list in requests for autonat-v2

1db8613

sukunrt mentioned this pull request Apr 19, 2023

use a priority ordered list of addresses in autonat v2 #541

Merged

sukunrt added 2 commits April 21, 2023 20:08

only send index of the dialed address

0ff8ac6

accept a priority ordered list of addresses for dial requests

f2a431c

thomaseizinger reviewed Apr 23, 2023

View reviewed changes

Improve naming for messages

62123df

add interaction diagram

0771bab

mxinden reviewed Apr 26, 2023

View reviewed changes

autonat/autonat-v2.md Show resolved Hide resolved

autonat/autonat-v2.md Outdated Show resolved Hide resolved

autonat/autonat-v2.md Outdated Show resolved Hide resolved

address review comments

3e57202

sukunrt force-pushed the autonat-v2 branch from 2c82bd1 to 3e57202 Compare April 27, 2023 11:33

MarcoPolo reviewed Apr 27, 2023

View reviewed changes

autonat/autonat-v2.md Outdated Show resolved Hide resolved

autonat/autonat-v2.md Outdated Show resolved Hide resolved

mxinden mentioned this pull request Nov 16, 2023

Autonat doesn't support multiple addresses well libp2p/rust-libp2p#4873

Open

add a response to the dialback stream

1c76613

sukunrt requested a review from thomaseizinger February 5, 2024 16:29

thomaseizinger approved these changes Feb 14, 2024

View reviewed changes

MarcoPolo mentioned this pull request Jun 19, 2024

AutoNAT: Network ReachabilityPublic distinguishes between IPv6 and IPv4 #614

Closed

MarcoPolo linked an issue Jun 19, 2024 that may be closed by this pull request

AutoNAT: Network ReachabilityPublic distinguishes between IPv6 and IPv4 #614

Closed

allow the client to send slightly more dial data

03718ef

umgefahren mentioned this pull request Aug 4, 2024

feat(autonat): Implement AutoNATv2 libp2p/rust-libp2p#5526

Merged

5 tasks

lidel mentioned this pull request Aug 5, 2024

feat: run AutoNAT V2 service in addition to V1 ipfs/kubo#10468

Merged

3 tasks

Stebalien reviewed Aug 12, 2024

View reviewed changes

MarcoPolo reviewed Sep 30, 2024

View reviewed changes

add note that server should not dial any private address

0195203

sukunrt requested a review from MarcoPolo October 31, 2024 04:32

sukunrt merged commit acd5c31 into autonat-rename Oct 31, 2024

sukunrt deleted the autonat-v2 branch October 31, 2024 04:34

sukunrt added a commit that referenced this pull request Oct 31, 2024

add autonat v2 spec (#538)

4ce6402

sukunrt mentioned this pull request Oct 31, 2024

add autonat v2 spec #638

Open

sukunrt added a commit that referenced this pull request Nov 1, 2024

add autonat v2 spec (#538)

55ae737

sukunrt added a commit that referenced this pull request Nov 1, 2024

add autonat v2 spec (#538)

d9f46e6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add autonat v2 spec #538

add autonat v2 spec #538

sukunrt commented Apr 11, 2023 •

edited

Loading

marten-seemann left a comment

sukunrt commented Apr 11, 2023

thomaseizinger left a comment •

edited

Loading

sukunrt commented Apr 25, 2023

mxinden left a comment

thomaseizinger commented Apr 27, 2023

MarcoPolo left a comment

sukunrt commented Apr 27, 2023

umgefahren commented Jan 28, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

sukunrt commented Feb 5, 2024

thomaseizinger left a comment

Stebalien Aug 12, 2024

Stebalien Aug 12, 2024

sukunrt Aug 13, 2024

sukunrt Sep 30, 2024

umgefahren Sep 30, 2024

p-shahi commented Sep 10, 2024 •

edited

Loading

umgefahren commented Sep 10, 2024

sukunrt commented Sep 11, 2024

MarcoPolo Sep 30, 2024

sukunrt Oct 29, 2024

MarcoPolo Oct 29, 2024

sukunrt Oct 29, 2024

MarcoPolo Oct 31, 2024 •

edited

Loading

sukunrt Oct 31, 2024

add autonat v2 spec #538

add autonat v2 spec #538

Conversation

sukunrt commented Apr 11, 2023 • edited Loading

marten-seemann left a comment

Choose a reason for hiding this comment

sukunrt commented Apr 11, 2023

thomaseizinger left a comment • edited Loading

Choose a reason for hiding this comment

sukunrt commented Apr 25, 2023

mxinden left a comment

Choose a reason for hiding this comment

thomaseizinger commented Apr 27, 2023

MarcoPolo left a comment

Choose a reason for hiding this comment

sukunrt commented Apr 27, 2023

umgefahren commented Jan 28, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

sukunrt commented Feb 5, 2024

thomaseizinger left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

p-shahi commented Sep 10, 2024 • edited Loading

umgefahren commented Sep 10, 2024

sukunrt commented Sep 11, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarcoPolo Oct 31, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sukunrt commented Apr 11, 2023 •

edited

Loading

thomaseizinger left a comment •

edited

Loading

p-shahi commented Sep 10, 2024 •

edited

Loading

MarcoPolo Oct 31, 2024 •

edited

Loading