how to pass where clause predicate to rewrite_data_files which uses year of a timestamp column #11789

salimpadela · 2024-12-14T21:40:47Z

Query engine

how to pass predicate in where clause for rewrite_data_files using Pyspark? If it matters, i am using AWS Glue to execute this job.

Question

I cant seem to figure out what is wrong with the way I am passing where clause predicate in rewrite_data_files.

`
spark.sql("CALL glue_catalog.system.rewrite_data_files(table=>'my-awesome-table', where => "col1 IN ('CT') AND col2 IN (5) AND year(CAST(col3 as DATE)) IN (1990)", strategy => 'binpack', options => map('min-input-files', '2'))")

Error Category: UNCLASSIFIED_ERROR; Failed Line Number: 3550; IllegalArgumentException: Cannot translate Spark expression: ((col1#50421 INSET CT AND col2#50422 INSET 10) AND year(cast(col3#50424 as date)) INSET 1990) to data source filter
`

I also tried AND year(col3) IN (1990) in the where clause.

If i don't pass AND year(CAST(col3 as DATE)) IN (1990) in the where clause, it works fine.

what am I missing here?

The text was updated successfully, but these errors were encountered:

manuzhang · 2024-12-16T13:57:49Z

You may try glue_catalog.system.years(ts) (where ts is a TIMESTAMP and you may cast your column to it)

salimpadela added the question Further information is requested label Dec 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to pass where clause predicate to rewrite_data_files which uses year of a timestamp column #11789

how to pass where clause predicate to rewrite_data_files which uses year of a timestamp column #11789

salimpadela commented Dec 14, 2024 •

edited

Loading

manuzhang commented Dec 16, 2024

how to pass where clause predicate to rewrite_data_files which uses year of a timestamp column #11789

how to pass where clause predicate to rewrite_data_files which uses year of a timestamp column #11789

Comments

salimpadela commented Dec 14, 2024 • edited Loading

Query engine

Question

manuzhang commented Dec 16, 2024

salimpadela commented Dec 14, 2024 •

edited

Loading