-
Notifications
You must be signed in to change notification settings - Fork 152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect stats getting populated for a Decimal column while converting DeltaTable to IcebergTable using XTable #608
Comments
@Prajwaltr011 can you attach the |
Sure @the-other-tim-brown. Please find attached delta logs files. One important thing to note it down is that we have ran "explain analyze" command on these tables using Trino which calculated trino meta stats. I am not sure whether this will have any impact on Iceberg stats population. 00000000000000000000.json |
Thanks @Prajwaltr011 I think there is some issue in how we create the intermediate representation of column stats data from the stats json in Delta Lake. I am working on some improvements to our integration tests to see if I can reproduce this. |
I hit some other issues while setting up the integration test but have confirmed that the Iceberg stats are stored by calling |
Thank you for catching this! and hope to see the fix soon. |
I found the issue can also happen for Hudi sources so patched that as well here: #617 |
Fantastic work!! @the-other-tim-brown |
Search before asking
Please describe the bug 🐞
Recently, we attempted to use Xtable on our database to convert Delta Tables to Iceberg Tables using the Xtable jar. The conversion was successful. However, we encountered discrepancies when reading the statistics of the Iceberg table. Specifically, for the decimal columns, the upper and lower bounds were incorrectly calculated in the Iceberg Avro snapshots file compared to native Iceberg tables. For instance, if the minimum and maximum of a decimal column are -8.0 and -5.0, the stats are showing 0.8 and 0.5, which is completely incorrect.
call_center.zip
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: