Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ETL superset usage metrics #179

Open
1 task
bendnorman opened this issue Sep 18, 2024 · 1 comment
Open
1 task

ETL superset usage metrics #179

bendnorman opened this issue Sep 18, 2024 · 1 comment
Labels
superset Relating to Superset usage metrics

Comments

@bendnorman
Copy link
Member

ETL Superset logs so we can understand who is using it and how!

There are a few sources of superset information

  1. the superset database and
  2. the Cloud logs produced by Cloud Run.
  3. the Cloud Monitoring Metrics of the Cloud Run instance.

I think the database will provide valuable information like users, queries and logs. This comment has some more information. The Cloud Logs will likely have everything in the logs table plus some additional Cloud Run operational information which could be helpful for monitoring the performance of specific operations. The Monitoring metrics information would be helpful for monitoring overall performance of the deployment. Monitoring metrics are currently retained for 6 weeks so we'd need to archive them.

I could see a few different phases for this work:

Phase I

  • Connect the superset db to superset and build a few simple dashboards off the raw tables.

Phase II

  • Set up a Cloud Log Sink for Cloud run so we can archive the logs Create a Cloud Logging sink for superset logs #178
  • Archive the Cloud monitoring metrics some how
  • If we feel like if the superset database tables require additional processing we could add it to the ETL

Phase II

  • Incorporate the Cloud Logs and Monitoring metrics into our ETL and dashboard if we think they are useful

Next steps

@bendnorman bendnorman added the superset Relating to Superset usage metrics label Sep 18, 2024
@bendnorman bendnorman moved this from New to Icebox in Catalyst Megaproject Sep 18, 2024
@bendnorman
Copy link
Member Author

Also, @e-belfer I connected the superset database to superset so you can explore the data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
superset Relating to Superset usage metrics
Projects
Status: Icebox
Development

No branches or pull requests

1 participant