-
Notifications
You must be signed in to change notification settings - Fork 175
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
spark connect fails when performing a .show()
#3498
Comments
also note Can we just use |
no, the logical plan display is equivalent to
|
@andrewgazelka I can take on this one. |
most of this is ported from the python impl inside `daft/runners/partitioning.py`. ### Note for reviewer. For context around why this is needed. The `DataFrame` class uses `PartitionSet` extensively for various common operations such as `show`, and `collect`. In order to add this functionality to our spark connect implementation, we need a similar construct in rust. Ideally, I'd like to port over the python implementation to use this new rust one, but there are still a few things that I'm not entirely sure how to implement (such as `RayPartitionSet`) Not all of the methods inside `partitioning.rs` are used yet, But I intend to follow up this PR with an implementation for #3498, and this is a prerequisite as `show` relies on `get_preview_micropartitions`.
Describe the bug
results in
"Error in Daft server: Unsupported relation type: ShowString"
To Reproduce
No response
Expected behavior
No response
Component(s)
Other
Additional context
It appears that currently our
show
logic exists purely in python. As a prerequisite, we'll need to refactor that logic into rust so that it can be used from spark connect.The text was updated successfully, but these errors were encountered: