fork-aware-tx-pool: add heavy load tests based on zombienet #7257

iulianbarbu · 2025-01-20T11:23:52Z

Description

Builds up towards addressing #5497 by creating some zombienet-sdk code infra that can be used to spin regular networks, as described in the fork aware transaction pool testing setup added here #7100. It will be used for developing tests against such networks, and to also spawn them on demand locally through tooling that will be developed in follow ups.

Integration

Node/runtime developers can run tests based on the zombienet-sdk infra that spins frequently used networks which can be used for analyzing behavior of various node related components, like fork aware transaction pool.

Review Notes

Uses ttxt API implemented here: https://github.com/michalkucharczyk/tx-test-tool/pull/22/files
currently only a test scenario is considered: 10k future & 10k ready txs are sent in parallel to a fatp based collator, and at the end we assert on two 10k batches of txs have been finalized.

Signed-off-by: Iulian Barbu <[email protected]>

substrate/client/transaction-pool/tests/zombienet/small_network_yap.rs

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk

So far looks good.

I was thinking about CLI. Maybe we don't need it, after all? Instead, we could use following:

$ cargo test --test stand_alone -- --exact run_single_collator_network

which would just:

spawn the network,
print out the location of the executed binaries (this one is important to me, to be absolutely sure that I don't run test on old binaries),
print out the location of logs file,
... or maybe just print zombienet summary - with ports, params, etc...
wait forever

Then one could use any tooling to just send transcations to this network.
We could start with this and see how it goes. We could do next iteration from here, and not over-complicate it from the beginning. In that way we would use the same config for manual testing, and for pre-defined test suits.

One more idea to control parameters would be using environment variables (not needed in the first step). Provides flexibility, less convenient to use comparing to CLI args, but much easier to implement.

export TXPOOLTESTS_POOL_LIMIT=1000
$ cargo test --test stand_alone -- --exact run_single_collator_network

Still we can have all the integration tests in different test module, reusing the same network configurations as those spawned in stand_alone mod, for example:

$ cargo test --test integration -- --exact single_collator_network__single_account_1M_txs

stand_alone tests would be excluded from cargo test command (as they never terminated on their own).

Any thoughts on this?

substrate/client/transaction-pool/tests/zombienet/mod.rs

substrate/client/transaction-pool/tests/zombienet/limits_30.rs

substrate/client/transaction-pool/tests/zombienet/yap.rs

substrate/client/transaction-pool/tests/zombienet/limits_30.rs

pepoviola · 2025-01-21T09:30:51Z

substrate/client/transaction-pool/tests/zombienet/limits_30.rs

+
+#[async_trait::async_trait]
+impl Network for Limits30Network {
+	fn ensure_bins_on_path(&self) -> bool {


Hi, sorry to chime in. This is already checked by zombienet-sdk internally (for each cmd to execute and the workers).

Is zombienet-sdk capable of printing full executable paths? (I know, I am a bit paranoid on this 😅)

@pepoviola can you point me to where we're doing these checks in zombienet-sdk?

So far the logs zombienet emits aren't showing the full binary paths for the node binaries it executes. @michalkucharczyk does it help if the logs contain the dump of the $PATH variable? Thinking we can make a feature request for zn-sdk logs to show that when configuring the network.

At the same time, @pepoviola , is there a way to tell a log file path to zn-sdk so that the log records emitted by zn-sdk could be tailed and easily analyzed later on, instead of showing it directly to stdout (which depending on the terminal settings, can be limited and polluted with other outputs?

pepoviola · 2025-01-21T09:35:30Z

Hi @iulianbarbu / @michalkucharczyk, I'm working on an small cli to spawn from toml and we already can load tomls. Did you think that could be handy here?
Thanks!

michalkucharczyk · 2025-01-21T09:53:17Z

Hi @iulianbarbu / @michalkucharczyk, I'm working on an small cli to spawn from toml and we already can load tomls. Did you think that could be handy here? Thanks!

My 3 cents:
Our goal here is to have some abstraction that allows to run some testsuit against predefined network, and also run exactly the same predefined network to conduct manual tests.

We actually want to spawn network programatically. So I am not sure that cli will be helpful here. But having some API in zombienet that would accept toml and spawn the network could be potentially helpful. Especially when it comes to customization - instead of playing with CLI args or enviroment variables as I proposed in my previous comment we could just edit toml file.

On the other hand, it seems that using zn-sdk is not that difficult.

@iulianbarbu what is your opinion?

iulianbarbu · 2025-01-22T11:16:16Z

Responding to the last messages where @pepoviola chimed in:

having some API in zombienet that would accept toml and spawn the network could be potentially helpful. Especially when it comes to customization - instead of playing with CLI args or enviroment variables as I proposed in my previous comment we could just edit toml file.

+1 to this idea @michalkucharczyk . I personally prefer Rust and zn-sdk for the testsuite, while for manual runs, if we'd be able to import the tomls directly with zn-sdk, and have the option to also use them with a CLI, then we can have the best of both worlds. It would be just a preference for how we'd like to do the manual testing, because we can still run the testsuite locally, by changing things within the rust tests, but if we want just to run the network and then do other stuff against it, we'd have the CLI as well.

we already can load tomls.

yup, thanks @pepoviola for confirming this offline. For reference: https://docs.rs/zombienet-sdk/latest/zombienet_sdk/struct.NetworkConfig.html#method.load_from_toml.

I'm working on an small cli to spawn from toml

@pepoviola how different would be from the existing zombienet CLI and why do we need another one?

Signed-off-by: Iulian Barbu <[email protected]>

alindima · 2025-01-23T13:29:46Z

We don't want to commit the chainspecs to the repo, right? The better way is to generate them using build scripts, like we do here for example: https://github.com/paritytech/polkadot-sdk/blob/master/polkadot/zombienet-sdk-tests/build.rs

michalkucharczyk · 2025-01-23T14:25:48Z

We don't want to commit the chainspecs to the repo, right? The better way is to generate them using build scripts, like we do here for example: https://github.com/paritytech/polkadot-sdk/blob/master/polkadot/zombienet-sdk-tests/build.rs

Good point. Chain-spec can be avoided with this commit:
#6267

Shall be enough to define dev-accounts in the genesis-patch (which in turn can be given in zobienet toml file).

However, not sure if we need extra aka-build.rs step. My guess would be it is not needed.

iulianbarbu · 2025-01-23T15:38:40Z

I think we need to mention the path to the runtime as well if mentioning the patch. That's a variable path, but we can assume it is target/release/wbuild/..., which should be fine for 99% of the cases (?). When loading the network with zn-sdk from zombienet.toml we must mention the runtime path in the toml file explicitly since there is no API to achieve changes after obtaining the network config.

However, not sure if we need extra aka-build.rs step. My guess would be it is not needed.

Can't see either how a build.rs can help when using zombienet-sdk with tomls loading.

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk · 2025-01-24T11:56:39Z

I think we need to mention the path to the runtime as well if mentioning the patch.

Maybe we don't need runtime path?
For runtimes that are already embedded in polkadot-parachain binary it should be enought to just give the name of the runtime. It is already done in many tomls across the codebase.

We could skip yap which is kinda experimental. (or add it in 2nd phase / followup).

iulianbarbu · 2025-01-24T11:59:41Z

Oh yeah, I guess asset-hub-* is covered. Leaving out experimental runtimes sounds good.

Signed-off-by: Iulian Barbu <[email protected]>

substrate/client/transaction-pool/tests/integration.rs

Signed-off-by: Iulian Barbu <[email protected]>

substrate/client/transaction-pool/tests/integration.rs

substrate/client/transaction-pool/tests/zombienet/mod.rs

substrate/client/transaction-pool/tests/integration.rs

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk · 2025-02-18T14:14:59Z

substrate/client/transaction-pool/tests/integration.rs

+	let future_scenario_executor = ScenarioBuilder::new()
+		.with_rpc_uri(ws.to_string())
+		.with_chain_type(ChainType::Sub)
+		.with_block_monitoring(block_monitor)


I suspect we will duplicate this across many tests, so this could be common function:

fn zn_scenario_builder(ws:&str) -> ScenarioBuilder { let send_threshold = 20_000; let block_monitor = false; let watched_txs = true; ScenarioBuilder::new() .with_rpc_uri(ws.to_string()) .with_chain_type(ChainType::Sub) .with_block_monitoring(block_monitor) .with_watched_txs(watched_txs) .with_send_threshold(send_threshold) }

Good idea, implemented here: 9a665b6

michalkucharczyk · 2025-02-18T14:15:47Z

substrate/client/transaction-pool/tests/integration.rs

@@ -58,28 +51,40 @@ async fn send_future_and_then_ready_from_many_accounts() {
 	// Shared params.
 	let send_threshold = 20_000;
 	let ws = "ws://127.0.0.1:9933";


Can we get this URL from zn API? (if we update toml this can diverge).

cc: @pepoviola

Yes, done here: 9a665b6

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk · 2025-02-18T16:19:37Z

substrate/client/transaction-pool/tests/integration.rs

+	let _ = net.wait_collator_client("charlie").await.unwrap();
+	let ws = net.node_rpc_uri("charlie").unwrap();
+	let future_scenario_executor = default_zn_scenario_builder()
+		.with_rpc_uri(ws.clone())
 		.with_chain_type(ChainType::Sub)


should go to default too

right 🙈 will do

michalkucharczyk · 2025-02-18T16:57:20Z

...te/client/transaction-pool/tests/zombienet/network-specs/asset-hub-high-pool-limit-fatp.toml

+# single-state
+# fork-aware


Suggested change

# single-state

# fork-aware

michalkucharczyk · 2025-02-18T16:57:53Z

...ansaction-pool/tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-3-collators.toml

+# single-state
+# fork-aware


Suggested change

# single-state

# fork-aware

michalkucharczyk · 2025-02-18T17:01:43Z

...te/client/transaction-pool/tests/zombienet/network-specs/asset-hub-high-pool-limit-fatp.toml

+validator = true
+
+[[relaychain.nodes]]
+name = "bob"


Maybe we should expose rpc port for bob too?

I think we can also enabled fatxpool on the relaychain. So we would need to copy some args to the relaychain nodes.

Some scenarios sending to the relay-chain would nice. As discussed in element they can also be quicker for development (no need to wait for 10 blocks).

michalkucharczyk · 2025-02-18T17:02:01Z

...ate/client/transaction-pool/tests/zombienet/network-specs/asset-hub-low-pool-limit-fatp.toml

+# single-state
+# fork-aware


Suggested change

# single-state

# fork-aware

michalkucharczyk · 2025-02-18T17:07:25Z

substrate/client/transaction-pool/Cargo.toml

 array-bytes = { workspace = true, default-features = true }
 assert_matches = { workspace = true }
+async-trait = { workspace = true }
+chrono = { workspace = true }


is chrono used?

From what I can see, some of other crates seems to be not used, too.

michalkucharczyk · 2025-02-18T17:12:17Z

substrate/client/transaction-pool/tests/zombienet/mod.rs

+pub const ASSET_HUB_LOW_POOL_LIMIT_FATP_SPEC_PATH: &'static str =
+	"tests/zombienet/network-specs/asset-hub-low-pool-limit-fatp.toml";
+pub const ASSET_HUB_HIGH_POOL_LIMIT_FATP_SPEC_PATH: &'static str =
+	"tests/zombienet/network-specs/asset-hub-high-pool-limit-fatp.toml";
+pub const ASSET_HUB_HIGH_POOL_LIMIT_OLDP_3_COLLATORS_SPEC_PATH: &'static str =
+	"tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-3-collators.toml";
+pub const ASSET_HUB_HIGH_POOL_LIMIT_OLDP_4_COLLATORS_SPEC_PATH: &'static str =
+	"tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-4-collators.toml";


nit: I would try to shorten the toml paths names using mod, e.g.:

Suggested change

pub const ASSET_HUB_LOW_POOL_LIMIT_FATP_SPEC_PATH: &'static str =

"tests/zombienet/network-specs/asset-hub-low-pool-limit-fatp.toml";

pub const ASSET_HUB_HIGH_POOL_LIMIT_FATP_SPEC_PATH: &'static str =

"tests/zombienet/network-specs/asset-hub-high-pool-limit-fatp.toml";

pub const ASSET_HUB_HIGH_POOL_LIMIT_OLDP_3_COLLATORS_SPEC_PATH: &'static str =

"tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-3-collators.toml";

pub const ASSET_HUB_HIGH_POOL_LIMIT_OLDP_4_COLLATORS_SPEC_PATH: &'static str =

"tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-4-collators.toml";

pub mod asset_hub_based_spec_paths {

pub const LOW_POOL_LIMIT_FATP: &'static str =

"tests/zombienet/network-specs/asset-hub-low-pool-limit-fatp.toml";

pub const HIGH_POOL_LIMIT_FATP: &'static str =

"tests/zombienet/network-specs/asset-hub-high-pool-limit-fatp.toml";

pub const HIGH_POOL_LIMIT_OLDP_3_COLLATORS: &'static str =

"tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-3-collators.toml";

pub const HIGH_POOL_LIMIT_OLDP_4_COLLATORS: &'static str =

"tests/zombienet/network-specs/asset-hub-high-pool-limit-oldp-4-collators.toml";

}

Shall be nicer to read when used in tests.

I would also use sstp (single-state-transaction-pool) instead of oldp.

michalkucharczyk · 2025-02-18T17:15:50Z

substrate/client/transaction-pool/tests/zombienet/mod.rs

+	}
+
+	/// Returns a node client and waits for blocks productio to kick-off.
+	pub async fn wait_collator_client(


collator is redundant here, we can wait for any node.
I would change the name to indicate that we are waiting for node to start producing blocks, maybe:
wait_for_block_production ?

michalkucharczyk · 2025-02-18T17:23:47Z

substrate/client/transaction-pool/tests/zombienet/mod.rs

+			.subscribe_best()
+			.await
+			.map_err(|_| Error::FailedToGetBlocksStream)?;
+		loop {


we shall wait for block with number 1, otherwise we'll exit the loop on genesis hash.

michalkucharczyk · 2025-02-18T17:41:46Z

substrate/client/transaction-pool/tests/integration.rs

+	let (future_logs, ready_logs) = futures::future::join(
+		future_scenario_executor.execute(),
+		ready_scenario_executor.execute(),


One more thing which would very useful is adding a prefix for debug logs allowing to distinguish instances of ttxt.

I think that tracing::span could be used for this, but I don't have enough experience with them to judge if it is doable.

iulianbarbu added 5 commits January 16, 2025 18:10

wip: impl parachain network with zn-sdk

0283ab9

Signed-off-by: Iulian Barbu <[email protected]>

wip adding more stuff to the nodes to catch all network

67c2ee9

Signed-off-by: Iulian Barbu <[email protected]>

made network primitives more flexible

7afefe1

Signed-off-by: Iulian Barbu <[email protected]>

added todos for what's wip

6d10819

Signed-off-by: Iulian Barbu <[email protected]>

added network base_dir & the rest of the testing networks

0e084ab

Signed-off-by: Iulian Barbu <[email protected]>

iulianbarbu added the R0-silent Changes should not be mentioned in any release notes label Jan 20, 2025

iulianbarbu requested a review from michalkucharczyk January 20, 2025 11:23

iulianbarbu self-assigned this Jan 20, 2025

iulianbarbu changed the title ~~Ib zn test fatp~~ fork-aware-tx-pool: add heavy load tests based on zombienet Jan 20, 2025

iulianbarbu mentioned this pull request Jan 20, 2025

fatxpool: add heavy load testsuits #5497

Open

lexnv reviewed Jan 20, 2025

View reviewed changes

substrate/client/transaction-pool/tests/zombienet/small_network_yap.rs Outdated Show resolved Hide resolved

iulianbarbu added 3 commits January 20, 2025 22:22

renames and added builder support

daba086

Signed-off-by: Iulian Barbu <[email protected]>

added todos

f73874a

Signed-off-by: Iulian Barbu <[email protected]>

fixed derive_builder erros

21fda5f

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk reviewed Jan 21, 2025

View reviewed changes

substrate/client/transaction-pool/tests/zombienet/limits_30.rs Outdated Show resolved Hide resolved

michalkucharczyk reviewed Jan 21, 2025

View reviewed changes

substrate/client/transaction-pool/tests/zombienet/limits_30.rs Outdated Show resolved Hide resolved

pepoviola reviewed Jan 21, 2025

View reviewed changes

iulianbarbu added 2 commits January 23, 2025 09:45

wip

ef9b658

Signed-off-by: Iulian Barbu <[email protected]>

wip: load zn tomls with zn-sdk

90ab08d

Signed-off-by: Iulian Barbu <[email protected]>

wip: add genesis config patches and path to runtime in zn tomls

94b8891

Signed-off-by: Iulian Barbu <[email protected]>

wip

136b95a

Signed-off-by: Iulian Barbu <[email protected]>

fix nonce incrementing

f275ba3

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk reviewed Feb 4, 2025

View reviewed changes

substrate/client/transaction-pool/tests/integration.rs Outdated Show resolved Hide resolved

iulianbarbu added 4 commits February 11, 2025 22:49

created test scenario

8de7754

Signed-off-by: Iulian Barbu <[email protected]>

updated assertions

e59e4bf

Signed-off-by: Iulian Barbu <[email protected]>

merge with master

2507d2c

Signed-off-by: Iulian Barbu <[email protected]>

updated ttxt usage

032e631

Signed-off-by: Iulian Barbu <[email protected]>

michalkucharczyk reviewed Feb 14, 2025

View reviewed changes

substrate/client/transaction-pool/tests/integration.rs Outdated Show resolved Hide resolved

michalkucharczyk reviewed Feb 14, 2025

View reviewed changes

substrate/client/transaction-pool/tests/zombienet/mod.rs Outdated Show resolved Hide resolved