AppView for aggregating block events with Pindexer #4752

ejmg · 2024-07-24T05:55:57Z

Describe your changes

This PR provides an AppView for aggregating block events using Pindexer.

This PR is a 180º of an 180º. Initially I was going to completely fan out a series of AppViews for every type relevant to block events but I ended up with a wide filter on the indexer that dispatches based on whether or not the event has a transaction id. From there, the function handler handle_block_event inserts the new block event.

Besides any concerns with the design, known issues/concerns:

comprehensive coverage of all known block_event types. The current set checked are drawn from my own archives of older test nets.
design: json operations aren't free but, in the case of a block explorer, the form allows a single, trivial query to pull all data efficiently from the AppView (SELECT * FROM block_events WHERE height=$1).
I realized that I have no clue where block as an event type originates. I couldn't find the emitting code in the code base of penumbra (esp the protos crate) so I assumed it's a pseudo event of some sort. If I'm wrong, that needs fixing.

Issue ticket number and link

pindexer: cuiloa: implement /api/block related events and tables #4745

Checklist before requesting a review

If this code contains consensus-breaking changes, I have added the "consensus-breaking" label. Otherwise, I declare my belief that there are not consensus-breaking changes, for the following reason:

This PR only adds an AppView to those already provided by the pindexer crate.

ejmg · 2024-07-24T05:59:02Z

I didn't get a reply on the indexer channel so I'm sorry for taking this approach in the end.

That said, if this is concerned OK, I plan on doing the same with Transactions.

If not, then my best idea for a different design is to create a series of modules (like those @hdevalence did for staking, delegation, etc), that all aggregate each type that is relevant and then it will be up to the consuming app/client to join over all of them. Alternatively, using a raw indexer might not be too bad of a choice? Querying info specific to a single block is the simplest query of cuiloa currently and relies on only a single CTE for aggregating transactions.

cronokirby · 2024-07-25T03:50:13Z

How will this data be presented in the block explorer? Will we just spit out JSON blobs?

ejmg · 2024-07-25T04:17:41Z

Yep. The strategy is fanning out wide with is_relevant and aggregating all associated block events into a single json array with objects with the shape of { $ProtoEventSchemaURI: $ProtoEventJsonString }. They key will let the consuming client (cuiloa, etc) be able to dispatch the correct ProtoBuff type for decoding.

cronokirby · 2024-07-25T16:27:32Z

What if we had a static table layout instead, with one table for each kind of event we want to display?

Basically, I'm not quite sure what the benefit of having this table is over reading the raw indexing database?

Also, storing an array of JSON objects in a column is not what we want, I think, instead we would want multiple events, as different rows.

cronokirby · 2024-07-25T16:36:19Z

For example, instead of recording that BlockRoot was one of the events in the block, instead we want to use this information to record that the block had a particular root hash. Similarly, for fees, we don't want to record this generically, instead we want to record that this block had particular fees, etc.

ejmg · 2024-07-25T21:34:15Z

Basically, I'm not quite sure what the benefit of having this table is over reading the raw indexing database?

This, more than anything, is something I was thinking the entire time. I don't know if you have looked closely to the raw indexer query I use for pulling in block info on Cuiloa but I achieve the same thing in roughly 20 lines of SQL and that's including a small CTE used for aggregating any related transactions. block_events is a view, so it's hiding an additional join but nothing else.

Why I still went ahead with writing this table was to see how it would work in terms of general performance and allowing the consuming client (Cuiloa) to have an extremely simple API to consume. Using the same design to aggregate transactions hashes associated to a block_id for a block_txs table would basically allow for a three liner of SQL for grabbing all events and transactions hashes for a given height. It's really convenient for the consumer, that's the biggest and only real argument I have for it.

Also, storing an array of JSON objects in a column is not what we want, I think, instead we would want multiple events, as different rows.

I wrote this with only Cuiloa in mind and not as a custom indexer for others to consumer, per se, with use cases that differ from Cuiloa. That said, I'm not making a principled argument that my approach is good.

What if we had a static table layout instead, with one table for each kind of event we want to display?
...
For example, instead of recording that BlockRoot was one of the events in the block, instead we want to use this information to record that the block had a particular root hash. Similarly, for fees, we don't want to record this generically, instead we want to record that this block had particular fees, etc.

These are questions that I started with and the solution I was original working out. This PR ended up going to the very opposite end of the spectrum by shoving every event into a single column. I'm not opposed to spinning out a bunch of custom indexers for each event type but I need a list of all ABCI events that are valid block events. It would also mean that a consuming application would need to perform at least 18 joins to comprehensively aggregate all possible block events. For transactions, that will be at least 25 joins. Is there a way we could simplify that or is that an acceptable trade off?

WIP AppView for aggregating block events with Pindexer

9586f89

ejmg added the A-indexing Area: Relates to event indexing. label Jul 24, 2024

Merge remote-tracking branch 'origin/main' into block-events-appview

45cbb97

ejmg temporarily deployed to smoke-test July 24, 2024 21:32 — with GitHub Actions Inactive

Updated block_event AppView impl for index_event to match new API

104c1e5

ejmg had a problem deploying to smoke-test July 24, 2024 21:55 — with GitHub Actions Error

CHORE: rustfmt

8a5354d

ejmg temporarily deployed to smoke-test July 24, 2024 22:05 — with GitHub Actions Inactive

Fixed JSON serialization for data insertion for block_event.

d1b7a9d

ejmg temporarily deployed to smoke-test July 25, 2024 00:09 — with GitHub Actions Inactive

ejmg added 2 commits July 24, 2024 19:47

Merge remote-tracking branch 'origin/main' into block-events-appview

f25c476

Disable Block indexer due to timestamp errors. TODO: fix

a80e177

ejmg temporarily deployed to smoke-test July 25, 2024 02:00 — with GitHub Actions Inactive

ejmg changed the title ~~WIP AppView for aggregating block events with Pindexer~~ AppView for aggregating block events with Pindexer Jul 25, 2024

ejmg mentioned this pull request Jul 25, 2024

pindexer::block panics on timestamp #4761

Closed

conorsch mentioned this pull request Jul 25, 2024

Fix cometindex destination db and table initialization logic #4758

Closed

1 task

ejmg mentioned this pull request Jul 25, 2024

Add rust bin for defining and running custom appviews using pindexer/cometindex penumbra-zone/cuiloa#148

Closed

vacekj closed this Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AppView for aggregating block events with Pindexer #4752

AppView for aggregating block events with Pindexer #4752

ejmg commented Jul 24, 2024 •

edited

Loading

ejmg commented Jul 24, 2024

cronokirby commented Jul 25, 2024

ejmg commented Jul 25, 2024

cronokirby commented Jul 25, 2024 •

edited

Loading

cronokirby commented Jul 25, 2024

ejmg commented Jul 25, 2024

AppView for aggregating block events with Pindexer #4752

AppView for aggregating block events with Pindexer #4752

Conversation

ejmg commented Jul 24, 2024 • edited Loading

Describe your changes

Issue ticket number and link

Checklist before requesting a review

ejmg commented Jul 24, 2024

cronokirby commented Jul 25, 2024

ejmg commented Jul 25, 2024

cronokirby commented Jul 25, 2024 • edited Loading

cronokirby commented Jul 25, 2024

ejmg commented Jul 25, 2024

ejmg commented Jul 24, 2024 •

edited

Loading

cronokirby commented Jul 25, 2024 •

edited

Loading