Skip to content

Commit

Permalink
vllm docs update
Browse files Browse the repository at this point in the history
  • Loading branch information
董晓龙 committed Sep 22, 2023
1 parent c147858 commit 5baeb38
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions docs/vllm_integration.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,3 +18,8 @@ See the supported models [here](https://vllm.readthedocs.io/en/latest/models/sup
```
python3 -m fastchat.serve.vllm_worker --model-path lmsys/vicuna-7b-v1.3 --tokenizer hf-internal-testing/llama-tokenizer
```
if you use a awq model, try
'''
python3 -m fastchat.serve.vllm_worker --model-path TheBloke/vicuna-7B-v1.5-AWQ --quantization awq
'''

0 comments on commit 5baeb38

Please sign in to comment.