[Spark] Propagate catalog table through DeltaSink #2109

ryan-johnson-databricks · 2023-09-26T21:58:01Z

Which Delta project/connector is this regarding?

Description

In order to implement #2052 for streaming writes, DeltaSink needs to track the catalog table, if any, so it can properly initialize the transactions it executes. We can't change the Spark DataSource API that creates the sink, so instead we add logic in DeltaAnalysis that extracts the catalog table from the WriteToStream and applies it to the underlying DeltaSink.

How was this patch tested?

New unit test.

Does this PR introduce any user-facing changes?

No.

spark/src/test/scala/org/apache/spark/sql/delta/DeltaSuite.scala

revert unwanted changes

ryan-johnson-databricks added 2 commits September 26, 2023 14:53

[Spark] Propagate catalog table through DeltaSink

204eb0c

fix delta source vs sink

618b6ea

tdas reviewed Sep 27, 2023

View reviewed changes

spark/src/test/scala/org/apache/spark/sql/delta/DeltaSuite.scala Outdated Show resolved Hide resolved

tdas reviewed Sep 27, 2023

View reviewed changes

spark/src/test/scala/org/apache/spark/sql/delta/DeltaSuite.scala Outdated Show resolved Hide resolved

add a real test

a4574d5

ryan-johnson-databricks requested a review from tdas September 27, 2023 14:09

Update DeltaSuite.scala

ff4a558

revert unwanted changes

scottsand-db approved these changes Sep 28, 2023

View reviewed changes

tdas approved these changes Sep 28, 2023

View reviewed changes

vkorukanti closed this in aa12854 Sep 28, 2023

ryan-johnson-databricks mentioned this pull request Oct 11, 2023

[Spark] Improves InvalidProtocolVersionException message #2118

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Spark] Propagate catalog table through DeltaSink #2109

[Spark] Propagate catalog table through DeltaSink #2109

ryan-johnson-databricks commented Sep 26, 2023

[Spark] Propagate catalog table through DeltaSink #2109

[Spark] Propagate catalog table through DeltaSink #2109

Conversation

ryan-johnson-databricks commented Sep 26, 2023

Which Delta project/connector is this regarding?

Description

How was this patch tested?

Does this PR introduce any user-facing changes?