feat(dbt): load dbt scaffold using `package_data` #15502

rexledesma · 2023-07-25T06:13:38Z

Summary & Motivation

Allow dagster-dbt project scaffold to load dbt projects from their path in package_data. This will be invoked in the dbt NUX.

We hardcode dbt-project as the dbt project directory name for the package_data. This restraint could be loosened in the future.

How I Tested These Changes

pytest
dbt NUX creates a pull request that builds to branch deployment: https://github.com/dagster-io/test-dagster-dbt-scaffold/pull/53
Github actions that build a Dagster package utilizing package_data chore: add steps to build Python package with dbt project dagster-cloud-action#147

rexledesma · 2023-07-25T06:13:50Z

Current dependencies on/for this PR:

master
- PR feat(dbt): load dbt scaffold using package_data #15502 👈

This comment was auto-generated by Graphite.

shalabhc

can a dbt scaffold work without including it in package data?
if not, this bool shouldn't be an option, we should always expect a subdir containing the dbt project

shalabhc · 2023-07-27T00:09:00Z

python_modules/libraries/dagster-dbt/dagster_dbt/include/setup.py.jinja

+    {% if use_dbt_project_package_data_dir -%}
+    package_data={
+        "{{ project_name }}": [
+            "dbt-project/*",


fyi not sure if you expected nested directories here but this might only do one level by default.

I did expect nested directories here, but looks like /* worked as I expected, since it built in cloud. But I'll use a recursive glob here, since that seems to be supported as well, and makes this behavior explicit: https://setuptools.pypa.io/en/latest/history.html#v62-3-0

shalabhc · 2023-07-27T00:19:35Z

python_modules/libraries/dagster-dbt/dagster_dbt/cli/app.py

@@ -89,6 +90,9 @@ def copy_scaffold(
        for target in profile["outputs"].values()
    ]

+    if use_dbt_project_package_data_dir:


I think this should just use the passed in dbt_project_dir and just error if the dbt_project_dir is not under the start path

rexledesma · 2023-07-27T00:34:45Z

can a dbt scaffold work without including it in package data? if not, this bool shouldn't be an option, we should always expect a subdir containing the dbt project

The scaffold can run locally without package data. However, it is required when you deploy. I was planning on adding documentation to explain the changes required to deploy a Dagster + dbt project in a Python package to cloud. But to shield the user from this complexity, by default we don't enable this in the open source CLI.

## Summary & Motivation Allow `dagster-dbt project scaffold` to load dbt projects from their path in `package_data`. This will be invoked in the dbt NUX. We hardcode `dbt-project` as the dbt project directory name for the `package_data`. This restraint could be loosened in the future. ## How I Tested These Changes - pytest - dbt NUX creates a pull request that builds to branch deployment: https://github.com/dagster-io/test-dagster-dbt-scaffold/pull/53 - Github actions that build a Dagster package utilizing `package_data` dagster-io/dagster-cloud-action#147

rexledesma force-pushed the rl/install-dbt-project-as-part-of-package branch 2 times, most recently from dbbb5c1 to 1f508a6 Compare July 26, 2023 00:23

rexledesma requested a review from sryza July 26, 2023 00:23

rexledesma self-assigned this Jul 26, 2023

rexledesma requested a review from shalabhc July 26, 2023 00:23

rexledesma marked this pull request as ready for review July 26, 2023 00:23

rexledesma force-pushed the rl/install-dbt-project-as-part-of-package branch from 1f508a6 to 509c470 Compare July 26, 2023 06:12

shalabhc approved these changes Jul 27, 2023

View reviewed changes

feat(dbt): load dbt scaffold using package_data

6c3d6b2

rexledesma force-pushed the rl/install-dbt-project-as-part-of-package branch from 509c470 to 6c3d6b2 Compare July 27, 2023 00:35

rexledesma merged commit 73b47a5 into master Jul 27, 2023

rexledesma deleted the rl/install-dbt-project-as-part-of-package branch July 27, 2023 00:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(dbt): load dbt scaffold using `package_data` #15502

feat(dbt): load dbt scaffold using `package_data` #15502

rexledesma commented Jul 25, 2023 •

edited

Loading

rexledesma commented Jul 25, 2023

shalabhc left a comment

shalabhc Jul 27, 2023

rexledesma Jul 27, 2023

shalabhc Jul 27, 2023

rexledesma commented Jul 27, 2023

feat(dbt): load dbt scaffold using package_data #15502

feat(dbt): load dbt scaffold using package_data #15502

Conversation

rexledesma commented Jul 25, 2023 • edited Loading

Summary & Motivation

How I Tested These Changes

rexledesma commented Jul 25, 2023

shalabhc left a comment

Choose a reason for hiding this comment

shalabhc Jul 27, 2023

Choose a reason for hiding this comment

rexledesma Jul 27, 2023

Choose a reason for hiding this comment

shalabhc Jul 27, 2023

Choose a reason for hiding this comment

rexledesma commented Jul 27, 2023

feat(dbt): load dbt scaffold using `package_data` #15502

feat(dbt): load dbt scaffold using `package_data` #15502

rexledesma commented Jul 25, 2023 •

edited

Loading