Skip to content

Commit

Permalink
update setting value arguments for vllm deployment
Browse files Browse the repository at this point in the history
Signed-off-by: Maxime Fournioux <[email protected]>
  • Loading branch information
mfournioux committed Dec 3, 2024
1 parent 3509d12 commit fbc3dc9
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion .github/workflows/lint-test.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -64,7 +64,7 @@ jobs:
run: |
export AWS_ACCESS_KEY_ID=minioadmin
export AWS_SECRET_ACCESS_KEY=minioadmin
helm install --wait --wait-for-jobs --timeout 5m0s --debug --create-namespace --namespace=ns-vllm test-vllm examples/chart-helm -f examples/chart-helm/values.yaml --set secrets.s3endpoint=http://minio:9000 --set secrets.s3bucketname=testbucket --set secrets.s3accesskeyid=$AWS_ACCESS_KEY_ID --set secrets.s3accesskey=$AWS_SECRET_ACCESS_KEY --set resources.requests.cpu=1 --set resources.requests.memory=4Gi --set resources.limits.cpu=2 --set resources.limits.memory=5Gi --set image.env[0].name=VLLM_CPU_KVCACHE_SPACE --set image.env[1].name=DTYPE --set image.env[2].name=VLLM_RPC_TIMEOUT --set image.env[3].name=VLLM_LOGGING_LEVEL --set image.env[4].name=ONEDNN_VERBOSE --set-string image.env[0].value="1" --set image.env[1].value=bfloat16 --set image.env[2].value=100000 --set-string image.env[3].value="DEBUG" --set image.env[4].value=all --set-string extraInit.s3modelpath="opt-125m/" --set-string 'resources.limits.nvidia\.com/gpu=0' --set-string 'resources.requests.nvidia\.com/gpu=0' --set-string image.repository="vllm-cpu-env"
helm install --wait --wait-for-jobs --timeout 5m0s --debug --create-namespace --namespace=ns-vllm test-vllm examples/chart-helm -f examples/chart-helm/values.yaml --set secrets.s3endpoint=http://minio:9000 --set secrets.s3bucketname=testbucket --set secrets.s3accesskeyid=$AWS_ACCESS_KEY_ID --set secrets.s3accesskey=$AWS_SECRET_ACCESS_KEY --set resources.requests.cpu=1 --set resources.requests.memory=4Gi --set resources.limits.cpu=2 --set resources.limits.memory=5Gi --set image.env[0].name=VLLM_CPU_KVCACHE_SPACE --set image.env[1].name=DTYPE --set image.env[2].name=VLLM_RPC_TIMEOUT --set image.env[3].name=VLLM_LOGGING_LEVEL --set image.env[4].name=ONEDNN_VERBOSE --set-string image.env[0].value="1" --set image.env[1].value=bfloat16 --set-string image.env[2].value="100000" --set-string image.env[3].value="DEBUG" --set image.env[4].value=all --set-string extraInit.s3modelpath="opt-125m/" --set-string 'resources.limits.nvidia\.com/gpu=0' --set-string 'resources.requests.nvidia\.com/gpu=0' --set-string image.repository="vllm-cpu-env"
- name: curl test
run: |

Check failure on line 70 in .github/workflows/lint-test.yaml

View workflow job for this annotation

GitHub Actions / actionlint

shellcheck reported issue in this script: SC2046:warning:3:29: Quote this to prevent word splitting
Expand Down

0 comments on commit fbc3dc9

Please sign in to comment.