You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My simple dataframe merging fails with shuffle_method='p2p' if I set dask.config.set(scheduler=<scheduler_type>). Even if I set it to a distributed scheduler. Switching to any other shuffle_method succeeds.
ValueError: No worker found
The above exception was the direct cause of the following exception:
RuntimeError Traceback (most recent call last)
File /mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:517, in handle_transfer_errors(id)
[516](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:516) try:
--> [517](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:517) yield
[518](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:518) except ShuffleClosedError:
File /mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_shuffle.py:83, in shuffle_transfer(input, id, input_partition, npartitions, column, meta, parts_out, disk, drop_column)
[82](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_shuffle.py:82) with handle_transfer_errors(id):
---> [83](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_shuffle.py:83) return get_worker_plugin().add_partition(
[84](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_shuffle.py:84) input,
...
[523](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:523) raise
[524](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:524) except Exception as e:
--> [525](https://vscode-remote+ssh-002dremote-002bdev.vscode-resource.vscode-cdn.net/mnt/home/andoni.garcia/code/research/.venv/lib/python3.11/site-packages/distributed/shuffle/_core.py:525) raise RuntimeError(f"P2P shuffling {id} failed during transfer phase") from e
RuntimeError: P2P shuffling 89e4b2019b4ea76e5a3ba83c87881f3f failed during transfer phase
Anything else we need to know?:
This is reproducible in both python scripts and Jupyter environments and across several tested dask versions.
Environment:
Dask version: 2024.10.0
Python version: Python 3.11.9
Operating System: Ubuntu 22.04.5 LTS
Install method (conda, pip, source): poetry
The text was updated successfully, but these errors were encountered:
Describe the issue:
My simple dataframe merging fails with
shuffle_method='p2p'
if I setdask.config.set(scheduler=<scheduler_type>)
. Even if I set it to a distributed scheduler. Switching to any othershuffle_method
succeeds.Minimal Complete Verifiable Example:
Yields:
Anything else we need to know?:
This is reproducible in both python scripts and Jupyter environments and across several tested dask versions.
Environment:
2024.10.0
Python 3.11.9
Ubuntu 22.04.5 LTS
poetry
The text was updated successfully, but these errors were encountered: