-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parquet error: Invalid offset in sparse column chunk data #8092
Comments
This should have been fixed by apache/arrow-rs#5036 Perhaps you could try out #8029 |
FWIW we have released arrow 48.0.1 (see apache/arrow-rs#5050) so a |
Please let us know if that doesn't work |
That is a good point The workaround suggested by @liukun4515 can be implemented like this: SQL❯ set datafusion.execution.parquet.enable_page_index = false;
0 rows in set. Query took 0.000 seconds. ProgramaticallyBy setting |
Describe the bug
Querying on a registered parquet table sometimes throws an error. The query is doing a range query over multiple fields. The same query with different literals, in which only one range impacts the output as the others are unbound works fine. Also, this happens for SQL and DataFrame alike.
To Reproduce
No response
Expected behavior
Query returns result.
Additional context
No response
The text was updated successfully, but these errors were encountered: