Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add storage_kind tag #95

Merged
merged 3 commits into from
Jun 7, 2024
Merged

Add storage_kind tag #95

merged 3 commits into from
Jun 7, 2024

Conversation

izzye84
Copy link
Collaborator

@izzye84 izzye84 commented Jun 6, 2024

Adding storage_kind to non-dbt assets in the hooli_data_eng code location.

Summary of changes:

  • Added config_utils.py to dynamically generate storage_kind (duckdb or snowflake) based on deployment (local or branch/prod)
  • Added storage_kind to non-dbt assets where the compute_kind didn't make this obvious

One thing to note, I defaulted the storage_kind to S3 for assets using the model_io_manager since it uses the FilesystemIOManager() locally and S3 in branch and prod deployments.

Copy link

github-actions bot commented Jun 6, 2024

Your pull request at commit 9fc1bbf4818a8f6ceea85ef6c57d0b8ed2f77160 is automatically being deployed to Dagster Cloud.

Location Status Link Updated
demo_assets View in Cloud Jun 06, 2024 at 11:06 PM (UTC)
basics View in Cloud Jun 06, 2024 at 11:06 PM (UTC)
batch_enrichment View in Cloud Jun 06, 2024 at 11:06 PM (UTC)
data-eng-pipeline View in Cloud Jun 06, 2024 at 11:06 PM (UTC)
snowflake_insights View in Cloud Jun 06, 2024 at 11:06 PM (UTC)

@cnolanminich
Copy link
Contributor

this is awesome!

One question -- is the goal of this PR that all of these assets (see below) have storage kinds? Also, I noticed that Sling doesn't -- I'm guessing that is a change to our sling integration and not something we can implement on our side though

image

Copy link

github-actions bot commented Jun 7, 2024

Your pull request at commit b6e5aeacb51f35d8f725f86a2035aef56d052edd is automatically being deployed to Dagster Cloud.

Location Status Link Updated
demo_assets View in Cloud Jun 07, 2024 at 08:11 PM (UTC)
basics View in Cloud Jun 07, 2024 at 08:11 PM (UTC)
batch_enrichment View in Cloud Jun 07, 2024 at 08:11 PM (UTC)
data-eng-pipeline View in Cloud Jun 07, 2024 at 08:11 PM (UTC)
snowflake_insights View in Cloud Jun 07, 2024 at 08:11 PM (UTC)

@izzye84
Copy link
Collaborator Author

izzye84 commented Jun 7, 2024

this is awesome!

One question -- is the goal of this PR that all of these assets (see below) have storage kinds? Also, I noticed that Sling doesn't -- I'm guessing that is a change to our sling integration and not something we can implement on our side though

image

I updated all assets (with a couple exceptions) to include storage_kind since the compute_kind doesn't necessarily have to match the storage_kind. That said, dagstermill_assets (model_nb) aren't yet supported and I'm still unsure how we want to represent the k8s storage_kind.

Copy link
Contributor

@cnolanminich cnolanminich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lgtm!

Copy link

github-actions bot commented Jun 7, 2024

Your pull request at commit 7658c221a871f2cf951ccf5e3f258fc92a6ae3ec is automatically being deployed to Dagster Cloud.

Location Status Link Updated
demo_assets View in Cloud Jun 07, 2024 at 09:00 PM (UTC)
basics View in Cloud Jun 07, 2024 at 09:00 PM (UTC)
batch_enrichment View in Cloud Jun 07, 2024 at 09:00 PM (UTC)
data-eng-pipeline View in Cloud Jun 07, 2024 at 09:00 PM (UTC)
snowflake_insights View in Cloud Jun 07, 2024 at 09:00 PM (UTC)

@izzye84 izzye84 merged commit 09f5a45 into master Jun 7, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants