Skip to content

Commit

Permalink
fix readme
Browse files Browse the repository at this point in the history
  • Loading branch information
kaushikmitr committed Mar 1, 2024
1 parent 0304a77 commit adef0c0
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion benchmarks/inference-server/triton/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -188,7 +188,7 @@ terraform apply
| `model_id` | Model used for inference. | String | `"meta-llama/Llama-2-7b-chat-hf"` | No |
| `gpu_count` | Parallelism based on number of gpus. | Number | `1` | No |
| `ksa` | Kubernetes Service Account used for workload. | String | `"default"` | No |
| `huggingface-secret` | Name of the kubectl huggingface secret token | String | `"huggingface-secret"` | No |
| `huggingface-secret` | Name of the kubectl huggingface secret token | String | `"huggingface-secret"` | Yes |
| `templates_path` | Path where manifest templates will be read from. | String | | No |

## Notes
Expand Down

0 comments on commit adef0c0

Please sign in to comment.