Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GLUTEN-8455][VL] Port encrypted file checks to shim layer #8501

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

ArnavBalyan
Copy link
Contributor

@ArnavBalyan ArnavBalyan commented Jan 10, 2025

  • Moved encrypted parquet file checks to shim, since parquet version changes across spark versions.
  • Added unit tests to cover the following cases:
    • No encrytpion.
    • Footer encrypted, and file encrypted
    • Footer plaintext, file encrypted
  • The behaviour continues to remain the same for any other case, and assume native scan is safe if we are unable to validate.
  • 3.5 can be checked in similar way, but provides better metadata for encryption checking, will add the support in next PR. Currently 3.5 retains the previous behaviour of always offloading.

@github-actions github-actions bot added CORE works for Gluten Core VELOX labels Jan 10, 2025
Copy link

#8455

Copy link

Run Gluten Clickhouse CI on x86

@ArnavBalyan
Copy link
Contributor Author

cc @Yohahaha, @jackylee-ch, can you please take a look. The exception checks also works for spark 3.5 but will use the footer metadata since it's more efficient

Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

update
Copy link

Run Gluten Clickhouse CI on x86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CORE works for Gluten Core VELOX
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant