Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support invalid partToExtract for parse_url #11661

Merged
merged 1 commit into from
Oct 25, 2024

Conversation

thirtiseven
Copy link
Collaborator

@thirtiseven thirtiseven commented Oct 25, 2024

Closes #11659

In parse_url we used to fallback to cpu if the partToExtract was not valid. However, the behaviour of cpu is just to always return null, we can also do that easily.

But in customer case #11659, it did run on gpu because GpuOverrides thought lowercase path was valid, which is not, so it went to a defensive branch which threw an exception.

This pr supports invalid partToExtract in parse_url running on gpu, and fixes the bug in its override.

@thirtiseven thirtiseven self-assigned this Oct 25, 2024
@thirtiseven
Copy link
Collaborator Author

build

@thirtiseven thirtiseven merged commit e31a710 into NVIDIA:branch-24.12 Oct 25, 2024
48 checks passed
@thirtiseven thirtiseven deleted the parse_url_bugfix branch October 25, 2024 14:43
@sameerz sameerz added the bug Something isn't working label Oct 28, 2024
liurenjie1024 pushed a commit to liurenjie1024/spark-rapids that referenced this pull request Oct 31, 2024
liurenjie1024 pushed a commit to liurenjie1024/spark-rapids that referenced this pull request Oct 31, 2024
Support invalid partToExtract for parse_url (NVIDIA#11661)

See merge request nvspark/bd-spark-rapids!53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] parse_url throws exception if partToExtract is invalid while Spark returns null
3 participants