Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[k8s] sky local up speed up for GPUs #3664

Merged
merged 1 commit into from
Jun 18, 2024
Merged

Conversation

romilbhardwaj
Copy link
Collaborator

This PR speeds up sky local up on GPU machines to ~2 min 30s, down from ~12 min on current master.

We remove the GPU labelling step, which is no longer required after #3493 since we can use nvidia's official labels.

We also remove the image pre-pull steps, since kubernetes (kind) pulling the image directly during launch is faster than docker pull -> kind load, which takes ~2x the time.

Tested on T4 and L4 GCP instances with sky local up and sky launch --gpus T4/L4:1 --cloud kubernetes -- nvidia-smi.

Copy link
Collaborator

@Michaelvll Michaelvll left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Aweseom @romilbhardwaj! LGTM.

@romilbhardwaj romilbhardwaj merged commit 9b4a54c into master Jun 18, 2024
20 checks passed
@romilbhardwaj romilbhardwaj deleted the k8s_localup_gpu_speedup branch June 18, 2024 21:24
Michaelvll pushed a commit that referenced this pull request Aug 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants