Add support to read iceberg partitioned data with non-identity transforms. #6438
Labels
feature request
New feature or request
iceberg
NoDocumentationNeeded
ReleaseNotesNeeded
Release notes are needed
Milestone
For identity transform partitioning case (the one we current support), iceberg tables have a stand-alone partitioning column part of the schema with separate values.
For non-identity transform partitioning case, the partition value is derived from another column. For example, we can have a timestamp column in the data and the partition value can be the day part of that column.
As part of this issue, we should add support for reading non-identity transform partitioning columns properly.
Reference: https://iceberg.apache.org/spec/#partition-transforms
I have added a script in the comments to generate such a table using pyiceberg.
The text was updated successfully, but these errors were encountered: