From adef0c0e859c9a6a26e40c21d313bd1bdb4cd67c Mon Sep 17 00:00:00 2001 From: kaushikmitr Date: Fri, 1 Mar 2024 23:38:23 +0000 Subject: [PATCH] fix readme --- benchmarks/inference-server/triton/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/benchmarks/inference-server/triton/README.md b/benchmarks/inference-server/triton/README.md index b7930c3e3..572274c06 100644 --- a/benchmarks/inference-server/triton/README.md +++ b/benchmarks/inference-server/triton/README.md @@ -188,7 +188,7 @@ terraform apply | `model_id` | Model used for inference. | String | `"meta-llama/Llama-2-7b-chat-hf"` | No | | `gpu_count` | Parallelism based on number of gpus. | Number | `1` | No | | `ksa` | Kubernetes Service Account used for workload. | String | `"default"` | No | -| `huggingface-secret` | Name of the kubectl huggingface secret token | String | `"huggingface-secret"` | No | +| `huggingface-secret` | Name of the kubectl huggingface secret token | String | `"huggingface-secret"` | Yes | | `templates_path` | Path where manifest templates will be read from. | String | | No | ## Notes