From 8227ce9b196991acd96a0e62596f575b560fd564 Mon Sep 17 00:00:00 2001 From: Romil Bhardwaj Date: Mon, 16 Oct 2023 14:53:08 -0700 Subject: [PATCH] [Docs][k8s] Update GKE driver installation for L4 nodes (#2705) * Update L4 note * fix --- .../source/reference/kubernetes/kubernetes-setup.rst | 12 +++++++++--- 1 file changed, 9 insertions(+), 3 deletions(-) diff --git a/docs/source/reference/kubernetes/kubernetes-setup.rst b/docs/source/reference/kubernetes/kubernetes-setup.rst index c1f103a8571..975ebeba177 100644 --- a/docs/source/reference/kubernetes/kubernetes-setup.rst +++ b/docs/source/reference/kubernetes/kubernetes-setup.rst @@ -90,16 +90,22 @@ Deploying on Google Cloud GKE 3. [If using GPUs] If your GKE nodes have GPUs, you may need to to `manually install `_ nvidia drivers. You can do so by deploying the daemonset - depending on the OS of your nodes: + depending on the GPU and OS on your nodes: .. code-block:: console - # For Container Optimized OS (COS) based nodes: + # For Container Optimized OS (COS) based nodes with GPUs other than Nvidia L4 (e.g., V100, A100, ...): $ kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/cos/daemonset-preloaded.yaml - # For Ubuntu based nodes: + # For Container Optimized OS (COS) based nodes with L4 GPUs: + $ kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/cos/daemonset-preloaded-latest.yaml + + # For Ubuntu based nodes with GPUs other than Nvidia L4 (e.g., V100, A100, ...): $ kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/ubuntu/daemonset-preloaded.yaml + # For Ubuntu based nodes with L4 GPUs: + $ kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/ubuntu/daemonset-preloaded-R525.yaml + To verify if GPU drivers are set up, run ``kubectl describe nodes`` and verify that ``nvidia.com/gpu`` is listed under the ``Capacity`` section. 4. Verify your kubeconfig (and GPU support, if available) is correctly set up by running :code:`sky check`: