Skip to content

Commit

Permalink
map gke h100 megas to 'H100' (#3691)
Browse files Browse the repository at this point in the history
* map gke h100 megas to 'H100'

* patch comment about H100 vs H100-mega

* format
  • Loading branch information
asaiacai authored Jul 2, 2024
1 parent d40081a commit 47d3dc0
Showing 1 changed file with 7 additions and 2 deletions.
9 changes: 7 additions & 2 deletions sky/provision/kubernetes/utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -193,8 +193,13 @@ def get_accelerator_from_label_value(cls, value: str) -> str:
return value.replace('nvidia-tesla-', '').upper()
elif value.startswith('nvidia-'):
acc = value.replace('nvidia-', '').upper()
if acc == 'H100-80GB':
# H100 is named as H100-80GB in GKE.
if acc in ['H100-80GB', 'H100-MEGA-80GB']:
# H100 is named H100-80GB or H100-MEGA-80GB in GKE,
# where the latter has improved bandwidth.
# See a3-mega instances on GCP.
# TODO: we do not distinguish the two GPUs for simplicity,
# but we can evaluate whether we should distinguish
# them based on users' requests.
return 'H100'
return acc
else:
Expand Down

0 comments on commit 47d3dc0

Please sign in to comment.