-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Time
Parquet Data Page Statistics
#11114
Comments
Can I work on this issue? kindly assign it to me! |
Hi @myeunee, you can just comment "take" and it will be automatically assigned to you. |
Also in general, feel free to work on any issue -- https://datafusion.apache.org/contributor-guide/index.html#finding-and-creating-issues-to-work-on 🚀 |
It looks like this issue is one of the last needed to complete the data page statistics extraction feature 🤔 |
Thank you @dharanad |
Is your feature request related to a problem or challenge?
Part of #10922
We are adding APIs to efficiently convert the data stored in Parquet's "PageIndex" into
ArrayRef
s -- which will make it significantly easier to use this information for pruning and other tasks.Describe the solution you'd like
Add support to
StatisticsConverter::min_page_statistics
andStatisticsConverter::max_page_statistics
for the types abovedatafusion/datafusion/core/src/datasource/physical_plan/parquet/statistics.rs
Lines 637 to 656 in a923c65
Describe alternatives you've considered
You can follow the model from @Weijun-H in #10931
Check::Both
, following the model oftest_int64
datafusion/datafusion/core/tests/parquet/arrow_statistics.rs
Lines 506 to 529 in a923c65
get_datapage_statistics
:datafusion/datafusion/core/src/datasource/physical_plan/parquet/statistics.rs
Line 624 in 459afbb
datafusion/datafusion/core/src/datasource/physical_plan/parquet/statistics.rs
Line 90 in 2f43476
Typically the change to the test looks like
Additional context
No response
The text was updated successfully, but these errors were encountered: