Add raw batch data wrapper query #77

harisang · 2024-12-02T00:24:11Z

This PR introduces query 4351957, that aims to provide a wrapper for all raw batch data tables we might upload on Dune, for all chains we are operating in.

It unifies the different tables that are already existent, and also finally introduces the auction id and environment columns for each batch/auction.

The plan is that this query will be updated in an automated fashion whenever a new table (beginning of each month) is introduced.

fleupold

I don't think the folder structure is a good choice. This seems strictly accounting related so it should go under cowprotocol/accounting.

fleupold · 2024-12-03T08:42:33Z

cowprotocol/raw_data/batch_data_4351957.sql

+    cast(capped_payment as decimal(38, 0)) as capped_payment,
+    cast(winning_score as decimal(38, 0)) as winning_score,
+    cast(reference_score as decimal(38, 0)) as reference_score
+from dune.cowprotocol.dataset_batch_data_{{blockchain}}_2024_10


Is the plan to add these 15 lines manually every month? Can we not come up with a better solution where this is automatically generated based on the current date?

If we go down this route, I'd at least like to see this query re-written in a way the the redundant part becomes just

union all select * from <new month table>

Otherwise this file will be a horror in 6 months from now.

The goal is that no one touches this query and this is updated automatically by the script that uploads data on Dune.

Indeed, ideally we should have a select *. These were uploaded using dune-sync-v1. If we are able to do proper type casting with dune-sync-v2, this could go away.

and this is updated automatically by the script that uploads data on Dune.

How would this work specifically? Would the script would make a pull request to this repository and automatically merge a change to the query on the first of each month?

Let's please think this process through to avoid a bad surprise 4 weeks from now when we think we are "done" with this project.

Ok good point about how to sync with this repo. I hadn't thought about this. So what I have in mind is basically what Bram has done when testing some versions of the sync in the Prefect repo: https://github.com/cowprotocol/Prefect/blob/a233d2831e936aa05ff4a7984aa1116580402e11/config/tasks/dune_sync.py#L121

Basically, since the dune-sync job will take a timestamp in order to work, if the timestamp says that it is the first day of the month, it would update the dune query automatically by pushing a change directly to Dune. Which means that the version of the query in this repo would get outdated. Unless we allow the script to also push directly to main, but i am not sure if this is necessary, since again, this query is not to be actively maintained by anyone.

An alternative that I tried was to actually prefill the query with all tables for the next few years (!). But it seems that Dune complains, as expected, for non-existent tables. Not sure if there is a workaround to that (we could create dummy tables for the next 48 months for example but not sure if we want that)

I think having this be an automated job makes sense (likely it would still benefit from having the only redundant part be select * from table_x rather than all the select statements. Therefore this query should not live here I believe.

harisang added 2 commits December 2, 2024 02:01

add raw batch data wrapper query

2d4ae55

add brief description

a00595b

harisang requested a review from fhenneke December 2, 2024 12:55

harisang mentioned this pull request Dec 3, 2024

Add raw order data wrapper query #80

Open

increase accuracy

73f2cdd

fleupold reviewed Dec 3, 2024

View reviewed changes

minor changes

72c99a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add raw batch data wrapper query #77

Add raw batch data wrapper query #77

harisang commented Dec 2, 2024

fleupold left a comment

fleupold Dec 3, 2024

harisang Dec 3, 2024 •

edited

Loading

fleupold Dec 4, 2024

harisang Dec 4, 2024

harisang Dec 4, 2024 •

edited

Loading

fleupold Dec 4, 2024

Add raw batch data wrapper query #77

Are you sure you want to change the base?

Add raw batch data wrapper query #77

Conversation

harisang commented Dec 2, 2024

fleupold left a comment

Choose a reason for hiding this comment

fleupold Dec 3, 2024

Choose a reason for hiding this comment

harisang Dec 3, 2024 • edited Loading

Choose a reason for hiding this comment

fleupold Dec 4, 2024

Choose a reason for hiding this comment

harisang Dec 4, 2024

Choose a reason for hiding this comment

harisang Dec 4, 2024 • edited Loading

Choose a reason for hiding this comment

fleupold Dec 4, 2024

Choose a reason for hiding this comment

harisang Dec 3, 2024 •

edited

Loading

harisang Dec 4, 2024 •

edited

Loading