Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Enable Parquet E2E filter tests on decimal type #11602

Closed
wants to merge 2 commits into from

Conversation

rui-mo
Copy link
Collaborator

@rui-mo rui-mo commented Nov 20, 2024

  1. Fixes 'BatchMaker::createVectorTypeKind::BIGINT' to ensure decimal vector
    can be generated.
  2. Fixes 'testRowGroupSkip' for short decimal. Clarifies that row group skip based
    on column statistics for decimal type is not currently supported.
  3. Supports 'testInt128' in IsNull filter.

@rui-mo rui-mo requested a review from majetideepak as a code owner November 20, 2024 05:45
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 20, 2024
@rui-mo rui-mo changed the title Enable Parquet E2E filter tests on decimal type fix: Enable Parquet E2E filter tests on decimal type Nov 20, 2024
Copy link

netlify bot commented Nov 20, 2024

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit bcdb766
🔍 Latest deploy log https://app.netlify.com/sites/meta-velox/deploys/673da6ad7a6e0200084d7059

@rui-mo rui-mo force-pushed the wip_part_decimal branch 2 times, most recently from 0ff89b3 to bcdb766 Compare November 20, 2024 09:06
Copy link
Collaborator Author

@rui-mo rui-mo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Yuhta @majetideepak Would you like to take a review? Thanks.

@@ -316,6 +316,9 @@ bool testFilter(
return true;
}
if (type->isDecimal()) {
// The min and max value in the metadata for decimal type in Parquet can be
// stored in different physical types, including int32, int64 and
// fixed_len_byte_array. The loading of them is not supported in Metadata.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I took a look at the support of row group skip for Parquet decimal and am glad to follow up in a separate PR if you think it is needed.

@Yuhta Yuhta added the ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall label Nov 20, 2024
@facebook-github-bot
Copy link
Contributor

@kagamiori has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

1 similar comment
@facebook-github-bot
Copy link
Contributor

@kagamiori has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@kagamiori merged this pull request in 1b75704.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants