slot-based-collator: Allow multiple blocks per slot #7569

skunert · 2025-02-13T17:54:06Z

Summary: This PR enables authoring of multiple blocks in one AURA slot in the slot-based collator and stabilizes the slot-based collator.

CLI Changes

The flag --experimental-use-slot-based is now marked as deprecated. I opted to introduce --authoring slot-based instead of just removing the experimental prefix. By introducing the authoring variant, we get some future-proofing in case we want to introduce further options.

Change Description

With elastic-scaling, we are able to author multiple blocks with a single relay-chain parent. In the initial iteration, the interval between two blocks was determined by the slot_duration of the parachain. This PR introduces a more flexible model, where we try to author multiple blocks in a single slot if the runtime allows it.

The block authoring loop is largely the same. The SlotTimer now lives in a separate module and is updated with the last seen core count. It will then trigger rounds in the block-building loop based on the core count.

This allows some flexibility where elastic-scaling chains can run on a single core in quiet times. Previously, running on 1 core with a 3-core elastic-scaling chain would result in authors getting skipped because the slot_duration was too low.

Parameter Considerations

The core logic does not change, so there are a few things to consider:

The ConsensusHook implementation still determines how many blocks are allowed per relay-chain block. So if you add arbitrary cores to an async-backing, 6-second parachain, can_build_upon in the runtime will deny block-building of additional blocks.
The MINIMUM_PERIOD in the runtime needs to be configured to allow enough blocks in the slot. A "classic" configuration of SLOT_DURATION/2 will lead to slot mismatches when running with 3 cores.
We fetch available cores at least once every relay chain block. So if a parachain runs with a 12-second slot duration and 1 fixed core, we would still author 2 blocks if the parachain runtime allows it.

skunert · 2025-02-14T14:45:08Z

/cmd prdoc --audience node_operator --bump major

…-bump major'

skunert · 2025-02-14T15:07:10Z

cumulus/zombienet/zombienet-sdk/src/elastic_scaling_multiple_blocks_per_slot.rs

This test is heavily inspired by the tests introduced for polkadot. However, I wanted to go with a version that is simpler to run by using the dynamic subxt feature. It is a bit more prone to breaking, but these tests should clearly fail if something changes, and it gets rid of the build.rs, env variables and zombie-metadata feature.

@skunert does make sense to apply the same changes to the helper in polkadot dir?
Thx!

IMO yes, we could unify all of this and get rid of the overhead. But it is a bit opinionated, and typically we had each team maintain their tests, so did not want to make that decision alone. If @alindima also finds it useful, we can unify in a follow-up (would like to keep the scope small here).

michalkucharczyk

1st round, will get back to it.

cumulus/zombienet/zombienet-sdk/Cargo.toml

cumulus/zombienet/zombienet-sdk/README.md

michalkucharczyk · 2025-02-14T13:54:39Z

cumulus/test/runtime/src/lib.rs

+
+#[cfg(feature = "elastic-scaling-multi-block-slot")]
+parameter_types! {
+	pub const MinimumPeriod: u64 = SLOT_DURATION / 6;


dq: why 6? This gives us support for max 6 cores?

This means that the time between blocks will be at least MINIMUM_PERIOD. If the inherent gives a smaller time than PREVIOUS_TIME + MINIMUM_PERIOD, then the time is set to PREVIOUS_TIME + MINIMUM_PERIOD. In order to produce that many blocks in a single slot, you need to make sure that the minimum period does not push the timestamp into the next slot, otherwise you will be greeted with a slot mismatch in the runtime.

cumulus/test/runtime/Cargo.toml

michalkucharczyk · 2025-02-14T14:02:41Z

cumulus/client/consensus/aura/src/collators/slot_based/block_builder_task.rs

@@ -91,6 +90,7 @@ pub struct BuilderTaskParams<
 	pub authoring_duration: Duration,
 	/// Channel to send built blocks to the collation task.
 	pub collator_sender: sc_utils::mpsc::TracingUnboundedSender<CollatorMessage<Block>>,
+	pub relay_chain_slot_duration: Duration,


doc is coming soon, right?