Make L1 client async again #2240

jbearer · 2024-10-31T00:55:11Z

Make L1 client update asynchronously in a background task. This allows us to subscribe to updates from the L1, rather than polling every time we have a new HotShot block. This vastly reduces the number of RPC calls we make. This is in response to an increase in the number of L1 RPC calls being made, due to the new logic that validates the L1 head and finalized in each proposal.

This PR:

Rewrites the L1 client to work based on an in-memory state that is updated by an async task
Implements the various wait_for_* functions using an event stream populated by the async task
Supports either HTTP or WebSockets as a backend for the L1 client
Adds an LRU cache for L1 blocks
Adds more configurability for L1 client

Key places to review:

types/src/v0/impls/l1.rs

This allows us to subscribe to updates from the L1, rather than polling every time we have a new HotShot block. This vastly reduces the number of RPC calls we make. Also adds an LRU cache for L1 blocks.

types/src/v0/impls/l1.rs

sveitser · 2024-11-04T11:21:17Z

types/src/v0/impls/l1.rs

+        let state = self.state.clone();
+        let sender = self.sender.clone();
+
+        let span = tracing::warn_span!("L1 client update");


What does this do? Does it apply to anything on the WARN level and more serious or only on the WARN level?

It applies to anything but the span will only show up on WARN or higher. But now that I think about it since this span is just a tag with no extra data, it's quite cheap to log this all the time, so this could easily be info_span!

er wait I had it backwards. This appears any time the log level is set to WARN or lower, just like a normal tracing::warn!. So we actually want to keep it this way, we want to see this info even if the node is running with RUST_LOG=warn

sveitser · 2024-11-04T11:29:28Z

types/src/v0/impls/l1.rs

                    continue;
+                };
+                if head >= number {


I think if we log that we are waiting for the block it's also nice to log that we found it.

sveitser · 2024-11-04T11:33:05Z

types/src/v0/impls/l1.rs

+                let L1Event::NewFinalized { finalized } = event else {
+                    continue;
+                };
+                if finalized.number < number {


nit: for the latest block we do the comparison the other way around above: if head >= number { I think we can do it the same way both times. Then we also don't need this continue, I think.

sveitser · 2024-11-04T11:45:37Z

types/src/v0/impls/l1.rs

-            tracing::warn!("sleeping for {dur:?}, until {timestamp}");
-            sleep(Duration::from_secs(dur)).await;
-        }
+        // Wait until the finalized block has timestamp >= `timestamp`.


Wondering if we actually need this function. It's only used on startup if the genesis block is not finalized.

espresso-sequencer/sequencer/src/lib.rs

Line 483 in b712b84

.wait_for_finalized_block_with_timestamp(timestamp.unix_timestamp().into())

I'm wondering if it would be okay to just panic in that case.

No, we can't just panic there. The intended use case is you set the genesis block to something in the future, then you can start up all the nodes ahead of time, and they will just wait until the genesis block is ready and then start running

sveitser · 2024-11-04T11:52:41Z

types/src/v0/impls/l1.rs

+        assert!(
+            state.snapshot.finalized.is_some()
+                && number <= state.snapshot.finalized.unwrap().number,
+            "requesting a finalized block {number} that isn't finalized; snapshot: {:?}",


Wonder why we don't make this a fallible function instead of one that might panic.

The idea is this can't panic based solely on the internal behavior of this module, i.e. we only ever call this private helper function with a finalized block number. There is no input from callers in any other modules that can cause this function to panic, so we shouldn't bubble the error up to the caller.

And because the eventual interface is infallible (e.g. validate_apply_header calls wait_for_finalized_block and blocks until we either get it or time out) any errors we return would have to be handled with retry loops which IMO makes the calling code more complicated for little reason, when it's an error that should never happen to begin with

sveitser · 2024-11-04T13:59:29Z

types/src/v0/v0_1/l1.rs

+pub(crate) struct L1State {
+    pub(crate) snapshot: L1Snapshot,
+    pub(crate) finalized: LruCache<u64, L1BlockInfo>,
+    pub(crate) finalized_by_timestamp: BTreeMap<U256, u64>,


Similar to my last comment, IIUC since is only used on startup and in-memory only I think it might be simpler if we didn't have it at all or at least didn't keep a cache for it.

Good call, removing this cache makes things considerably simpler at very little cost

sveitser

Overall LGTM, just non-urgent suggestions.

Co-authored-by: Mathis <[email protected]>

This cache is used only at startup, and it barely even saves us any RPC calls, since the loop where we work backwards looking for the earliest block with a given timestamp already uses the other cache. Removing this simplifies things considerably.

jbearer requested review from philippecamacho, ImJeremyHe, sveitser, tbro and imabdulbasit as code owners October 31, 2024 00:55

jbearer added 3 commits October 31, 2024 08:48

Support both HTTP and WS L1 providers

586bc63

Make L1 client update asynchronously in a background task

18d4f00

This allows us to subscribe to updates from the L1, rather than polling every time we have a new HotShot block. This vastly reduces the number of RPC calls we make. Also adds an LRU cache for L1 blocks.

More configurability for L1 client

b712b84

jbearer force-pushed the jb/async-l1-client branch from c58b6b2 to b712b84 Compare October 31, 2024 15:48

sveitser reviewed Nov 4, 2024

View reviewed changes

types/src/v0/impls/l1.rs Outdated Show resolved Hide resolved

sveitser reviewed Nov 4, 2024

View reviewed changes

types/src/v0/impls/l1.rs Outdated Show resolved Hide resolved

sveitser reviewed Nov 4, 2024

View reviewed changes

sveitser self-requested a review November 4, 2024 15:34

sveitser approved these changes Nov 4, 2024

View reviewed changes

jbearer and others added 3 commits November 4, 2024 09:06

Update types/src/v0/impls/l1.rs

c22b1d9

Co-authored-by: Mathis <[email protected]>

Update types/src/v0/impls/l1.rs

3938d7a

Co-authored-by: Mathis <[email protected]>

jbearer merged commit 5bc096e into main Nov 4, 2024
18 checks passed

jbearer deleted the jb/async-l1-client branch November 4, 2024 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make L1 client async again #2240

Make L1 client async again #2240

jbearer commented Oct 31, 2024 •

edited

Loading

sveitser Nov 4, 2024

jbearer Nov 4, 2024

jbearer Nov 4, 2024

sveitser Nov 4, 2024

sveitser Nov 4, 2024

sveitser Nov 4, 2024

jbearer Nov 4, 2024

sveitser Nov 4, 2024

jbearer Nov 4, 2024

sveitser Nov 4, 2024

jbearer Nov 4, 2024

sveitser left a comment

Make L1 client async again #2240

Make L1 client async again #2240

Conversation

jbearer commented Oct 31, 2024 • edited Loading

This PR:

Key places to review:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sveitser left a comment

Choose a reason for hiding this comment

jbearer commented Oct 31, 2024 •

edited

Loading