Skip to content

Commit

Permalink
Merge remote-tracking branch 'origin/main' into hpa_custommetrics
Browse files Browse the repository at this point in the history
  • Loading branch information
rsgowman committed Mar 6, 2024
2 parents 5c6cd2d + 92055de commit 5769207
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ spec:
- name: text-generation-inference
ports:
- containerPort: 80
image: "ghcr.io/huggingface/text-generation-inference:1.1.1"
image: "ghcr.io/huggingface/text-generation-inference:1.4.2"
args: ["--model-id", "${model_id}", "--num-shard", "${gpu_count}"] # , "{token}" tensor parallelism, should correspond to number of gpus below
%{ for hugging_face_token_secret in hugging_face_token_secret_list ~}
env:
Expand Down

0 comments on commit 5769207

Please sign in to comment.