Skip to content

Commit

Permalink
Revert image on benchmarks until verified
Browse files Browse the repository at this point in the history
Included a harmless TODO note so that we remember to come back to this
once the benchmark is verified with the latest Hugging Face DLC for TGI
cc @annapendleton
  • Loading branch information
alvarobartt committed Sep 17, 2024
1 parent f7462e6 commit c5a4669
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ spec:
- name: text-generation-inference
ports:
- containerPort: 80
image: "us-docker.pkg.dev/deeplearning-platform-release/gcr.io/huggingface-text-generation-inference-cu121.2-2.ubuntu2204.py310"
image: "ghcr.io/huggingface/text-generation-inference:1.4.2" # TODO(annapendleton,alvarobartt): update to Hugging Face DLC once verified
args: ["--model-id", "${model_id}", "--num-shard", "${gpu_count}", "--max-concurrent-requests", "${max_concurrent_requests}"]
env:
%{ for hugging_face_token_secret in hugging_face_token_secret_list ~}
Expand Down

0 comments on commit c5a4669

Please sign in to comment.