-
Notifications
You must be signed in to change notification settings - Fork 671
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Failed to run rayjob in the sandbox #4026
Comments
I am interested! |
The issue is potentially here: https://github.com/flyteorg/flytekit/blob/master/flytekit/core/python_function_task.py#L101 Without a reload of the config, the task would be considered a python task and not a ray task |
Ok I have worked around the issue by locating the flyte pod and killing it. When the deployment restarts it, the pod will get the new config. The bug therefore is that ConfigMap changes are not watched for and reloaded by the demo cluster.
|
@ashahab Did Rayjob complete without errors? |
While running in the EKS cluster do you see the head and worker node created? Despite following what was instructed in the example, for me it always ran in local mode in the EKS cluster. |
For those who run into same issue, the job failed because I'm running sandbox on M2. Ray doesn't have muti-arch image, so you have to build an ARM image by yourself. |
Describe the bug
I can't run the Ray task in the sandbox, but it works if I run the same task in the EKS cluster.
Are you sure this issue hasn't been raised already?
Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: