Skip to content

Commit

Permalink
Allow eager_prefill to be set in Helm chart (#557)
Browse files Browse the repository at this point in the history
  • Loading branch information
bdalal authored Jul 26, 2024
1 parent 2e81331 commit 15a38d5
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 0 deletions.
2 changes: 2 additions & 0 deletions charts/lorax/templates/deployment.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,8 @@ spec:
- {{ .Values.deployment.args.maxBatchPrefillTokens | quote }}
- --sharded
- {{ .Values.deployment.args.sharded | quote }}
- --eager-prefill
- {{ .Values.deployment.args.eagerPrefill | quote }}
env:
- name: PORT
value: "8000"
Expand Down
1 change: 1 addition & 0 deletions charts/lorax/values.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@ deployment:
maxBatchTotalTokens: 4096
maxBatchPrefillTokens: 2048
sharded: false
eagerPrefill: false

env:
# Your huggingface hub token. Required for some models such as the llama-2 family.
Expand Down

0 comments on commit 15a38d5

Please sign in to comment.