Upserting records to s3: .s3.merge_upsert_table vs overwrite partition parameter in s3 write functions #635
-
I would like to upsert records in a parquet table without overwriting the entire partition. Would the s3.merge_upsert_table function achieve this or is the s3.merge_upsert_table method just the same as using s3.to_parquet method with the parameter overwrite_partitions (Partition Upsert) enabled? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Thank you for discussing this. The current I have already covered this in this issue but to summarise once more:
|
Beta Was this translation helpful? Give feedback.
Thank you for discussing this. The current
s3.merge_upsert_table
method is very naive and lacks numerous features. As you have pointed out in your raised issues, loading the entire table into a data frame is inefficient and the method does not support partitioning.I have already covered this in this issue but to summarise once more: