You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Use Case
my specific use case is to override the basename_template param because at times when I'm bombarding S3 with thousands of concurrent tasks, hitting the parquet file directly without using the txn log to find the files yields performance benefits. hence, explicitly naming the parquet file allows me to statically deterministically specify the filename (as opposed to querying for it at runtime)
Furthermore, this would future proof and expose any additional kwargs / enhancements to the pyarrow datasets api
I'm happy to propose the change and submit PR if this is an acceptable enhancement.
The text was updated successfully, but these errors were encountered:
Description
allow passing
ds.write_dataset
kwarg overrides intowrite_deltalake
functiondelta-rs/python/deltalake/writer.py
Lines 327 to 344 in fcfd1bf
Use Case
my specific use case is to override the basename_template param because at times when I'm bombarding S3 with thousands of concurrent tasks, hitting the parquet file directly without using the txn log to find the files yields performance benefits. hence, explicitly naming the parquet file allows me to statically deterministically specify the filename (as opposed to querying for it at runtime)
Furthermore, this would future proof and expose any additional kwargs / enhancements to the pyarrow datasets api
I'm happy to propose the change and submit PR if this is an acceptable enhancement.
The text was updated successfully, but these errors were encountered: