use "direct" write for non-partitioned python model materializations #1388

colin-rogers-dbt · 2024-10-29T00:32:40Z

resolves #1318

In order to support partitioned materializations in BQ python models we we switched from "direct" to "indirect" mode when writing model results in Dataproc back to BigQuery. As the naming implies "indirect" temporarily stages data in the provided GCS bucket. If a user has a retention policy on the bucket this will fail as the bucket won't allow Dataproc to delete these temp files as it goes.

This PR sidesteps that issue to ensure backwards compatibility with <1.7 created models by using "direct" write when a partitioned config is not provided.

Note: I have not added any new testing to cover the case where a user has set a retention policy. Ultimately I think this is an edge case we don't need to test against but we should document that a bucket retention policy cannot be used with a partitioned python model as a follow up

docs dbt-labs/docs.getdbt.com/#

Problem

Solution

Checklist

I have read the contributing guide and understand what's expected of me
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
This PR has no interface changes (e.g. macros, cli, logs, json artifacts, config files, adapter interface, etc) or this PR has already received feedback and approval from Product or DX

dbt/include/bigquery/macros/materializations/table.sql

VersusFacit

one nit

colin-rogers-dbt added 5 commits October 28, 2024 16:45

use dynamic schema in test_grant_access_to.py

c03918d

use dynamic schema in test_grant_access_to.py

e343185

revert setup

251cff1

use "direct" write for non-partitioned python model materializations

0605d14

add changie log

64a1e1a

colin-rogers-dbt self-assigned this Oct 29, 2024

colin-rogers-dbt requested a review from a team as a code owner October 29, 2024 00:32

cla-bot bot added the cla:yes label Oct 29, 2024

colin-rogers-dbt added 5 commits October 28, 2024 17:35

add code comment

ab84878

make code comment inline

12920c3

make code comment inline

79b5c68

remove code comment

a97abb1

use set write_method instead of inline conditional

ec96883

VersusFacit reviewed Oct 29, 2024

View reviewed changes

dbt/include/bigquery/macros/materializations/table.sql Outdated Show resolved Hide resolved

VersusFacit approved these changes Oct 29, 2024

View reviewed changes

use set write_method instead of inline conditional

f998349

colin-rogers-dbt merged commit a09a8fa into main Oct 29, 2024
42 checks passed

colin-rogers-dbt deleted the supportDirectWritesAgain branch October 29, 2024 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use "direct" write for non-partitioned python model materializations #1388

use "direct" write for non-partitioned python model materializations #1388

colin-rogers-dbt commented Oct 29, 2024 •

edited

Loading

VersusFacit left a comment

use "direct" write for non-partitioned python model materializations #1388

use "direct" write for non-partitioned python model materializations #1388

Conversation

colin-rogers-dbt commented Oct 29, 2024 • edited Loading

Problem

Solution

Checklist

VersusFacit left a comment

Choose a reason for hiding this comment

colin-rogers-dbt commented Oct 29, 2024 •

edited

Loading