Artifacts for installing the Distributed Workloads stack as part of ODH
Distributed Workloads is a simple, user-friendly abstraction for scaling, queuing and resource management of distributed AI/ML and Python workloads. It consists of three components:
-
CodeFlare SDK to define and control remote distributed compute jobs and infrastructure with any Python based environment
-
Multi-Cluster Application Dispatcher (MCAD) for management of batch jobs
-
Instascale for on-demand scaling of a Kubernetes cluster
-
KubeRay for management of remote Ray clusters on Kubernetes for running distributed compute workloads
Integration of this stack into the Open Data Hub is owned by the Distributed Workloads Working Group. See this page for further details and how to get in touch.
Component | Version |
---|---|
CodeFlare Operator | v0.0.4 |
Multi-Cluster App Dispatcher | v1.31.0 |
CodeFlare-SDK | v0.4.4 |
InstaScale | v0.0.4 |
KubeRay | v0.5.0 |
Follow our quick start guide here to get up and running with Distributed Workflows on Open Data Hub.