Tracking issue: Improve parachain liveliness observability #196

sandreim · 2023-01-06T13:23:07Z

The Problem

We currently rely only on node side metrics to observe the liveliness of the network. Mainly, we look at things like parachain block times, approval checking, disputes and finality lag which we are able to determine only if we have access to metrics scraped from validators and collators. This works really nice for test networks where we manage both the validators and collators. We want to build some tooling to enable additional observability via RPC . This would not obsolete any of the node metrics, as we would still rely on those when debugging.

Plan

We need to implement tracking of all parachains in the parachain commander which is currently limited to only one parachain. This full tracking should be the deafult mode for running the tool in Prometheus mode while CLI remains unchanged to tracking one parachain.

Metrics

The following metrics will be computed from the parchain inherent data from each relay chain block:

parachain block times (measure in relay chain blocks)
relay chain block times (via inherent timestamps)
availability health (bitfield count and 1 bits when core occupied)
backing
DMP/UMP/HRMP throughput
dispute initiation and conclusion times
dispute initiation per validator (via address label)

Deployment

We want to deploy this for both prod and test networks: Kusama, Polkadot, Westend, Rococo and Versi.
Additionally to the implementation we need to create Grafana dashboards that will be available in the polkadot-introspector repo.

Milestones

all metrics implemented
dashboards created and published
deployed on Polkadot/Kusama/Westend/Rococo
alerting and paging configured

Project tracking board

https://github.com/orgs/paritytech/projects/70/views/1

The text was updated successfully, but these errors were encountered:

sandreim added T4-Parachains_Engineering meta labels Jan 6, 2023

sandreim added this to Parachains-core Jan 6, 2023

github-project-automation bot moved this to To do in Parachains-core Jan 6, 2023

sandreim moved this from To do to In progress in Parachains-core Jan 6, 2023

AndreiEres added this to Polkadot Introspection Jun 8, 2023

github-project-automation bot moved this to Inbox in Polkadot Introspection Jun 8, 2023

AndreiEres moved this from Inbox to Todo in Polkadot Introspection Jun 8, 2023

the-right-joyce added this to parachains team board Oct 12, 2023

the-right-joyce moved this to In Progress in parachains team board Oct 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issue: Improve parachain liveliness observability #196

Tracking issue: Improve parachain liveliness observability #196

sandreim commented Jan 6, 2023 •

edited

Loading

Tracking issue: Improve parachain liveliness observability #196

Tracking issue: Improve parachain liveliness observability #196

Comments

sandreim commented Jan 6, 2023 • edited Loading

The Problem

Plan

Metrics

Deployment

Milestones

Project tracking board

sandreim commented Jan 6, 2023 •

edited

Loading