You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I had searched in the issues and found no similar feature requirement.
Description
I was chatting with Michael and Pradeep and I think this is one of the things that will make early stage ray poc easier. I know ray removed redis as default option some time ago, but for us, when successfully running ray for the first time and see ray being restarted in the middle of training was a great experience. we should probably consider including redis + fault tolerance as part of the helm chart and easier for users to enable?
Use case
I think it will be a great experience to have fault tolerance enabled by default (or at least having one option to enable it), especially in k8s environment.
Related issues
No response
Are you willing to submit a PR?
Yes I am willing to submit a PR!
The text was updated successfully, but these errors were encountered:
I think it will be a great experience to have fault tolerance enabled by default (or at least having one option to enable it), especially in k8s environment.
Search before asking
Description
I was chatting with Michael and Pradeep and I think this is one of the things that will make early stage ray poc easier. I know ray removed redis as default option some time ago, but for us, when successfully running ray for the first time and see ray being restarted in the middle of training was a great experience. we should probably consider including redis + fault tolerance as part of the helm chart and easier for users to enable?
Use case
I think it will be a great experience to have fault tolerance enabled by default (or at least having one option to enable it), especially in k8s environment.
Related issues
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: