-
-
Notifications
You must be signed in to change notification settings - Fork 4.6k
Issues: vllm-project/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Installation]: VLLM on ARM machine with GH200
installation
Installation problems
#10459
opened Nov 19, 2024 by
Phimanu
1 task done
[Bug]: Unable to configure formatter 'vllm'
bug
Something isn't working
#10457
opened Nov 19, 2024 by
pspdada
1 task done
[Bug] torch.distributed.DistBackendError: NCCL error in docker.io/vllm/vllm-openai:v0.6.4.post1
bug
Something isn't working
#10453
opened Nov 19, 2024 by
victorserbu2709
1 task done
[Bug]: Breaking Change in Something isn't working
gpu_memory_utilization
Behavior in vLLM 0.6.4
bug
#10451
opened Nov 19, 2024 by
movchan74
1 task done
[Bug]: Duplicate trace Id
bug
Something isn't working
#10447
opened Nov 19, 2024 by
LakshmiPriyaSujith
1 task done
[Feature]: LoRA fine-tuning model for DeepSeek V2
feature request
#10446
opened Nov 19, 2024 by
Z-Diviner
1 task done
[Bug]: RuntimeError: CUDA error: operation not permitted when stream is capturing when serving llama 3.2 90b
bug
Something isn't working
#10445
opened Nov 19, 2024 by
bingwork
1 task done
[Bug]: request reward model report 500 Internal Server Error
bug
Something isn't working
#10444
opened Nov 19, 2024 by
hrdxwandg
1 task done
[Bug]: Speculative decoding + guided decoding not working
bug
Something isn't working
#10442
opened Nov 19, 2024 by
arunpatala
1 task done
[Bug]: Input prompt (35247 tokens) is too long and exceeds limit of 1000
bug
Something isn't working
#10440
opened Nov 19, 2024 by
Crista23
[Bug]: vllm server crash when num-scheduler-steps > 1 and max_tokens=0
bug
Something isn't working
#10432
opened Nov 18, 2024 by
atanikan
1 task done
[Doc]: Pages were moved without a redirect
documentation
Improvements or additions to documentation
#10428
opened Nov 18, 2024 by
shannonxtreme
1 task done
[Doc]: Migrate to Markdown
documentation
Improvements or additions to documentation
#10427
opened Nov 18, 2024 by
rafvasq
1 task done
[Feature]: Add Support for Specifying Local CUTLASS Source Directory via Environment Variable
feature request
#10423
opened Nov 18, 2024 by
wchen61
1 task done
[Doc]: Compare LMDeploy vs vLLM AWQ Triton kernels
documentation
Improvements or additions to documentation
#10420
opened Nov 18, 2024 by
casper-hansen
1 task done
[Bug]: NCCL error with 2-way pipeline parallelism.
bug
Something isn't working
#10419
opened Nov 18, 2024 by
Pl4tiNuM
1 task done
[Bug]: KV Cache Quantization with GGUF turns out quite poorly.
bug
Something isn't working
#10411
opened Nov 18, 2024 by
phazei
1 task done
[Bug]: (Program crashes after increasing --tensor-parallel-size) with error pynvml.NVMLError_InvalidArgument: Invalid Argument
bug
Something isn't working
#10409
opened Nov 18, 2024 by
JohnConnor123
1 task done
[Bug]: 使用vllm和transformer部署Qwen2vl,同一张图片输出结果不一致
bug
Something isn't working
#10408
opened Nov 18, 2024 by
Apricot1225
1 task done
[New Model]: fishaudio/fish-speech-1.4
new model
Requests to new models
#10404
opened Nov 17, 2024 by
cavities
1 task done
[Bug]: v0.6.4.post1 crashed:Error in model execution: CUDA error: an illegal memory access was encountered
bug
Something isn't working
#10389
opened Nov 16, 2024 by
wciq1208
1 task done
[Misc]: Ask for the roadmap of async output processing support for speculative decoding
misc
#10387
opened Nov 16, 2024 by
Lin-Qingyang-Alec
1 task done
[Bug]: Granite 3.0 disconnect between parser and example template
bug
Something isn't working
#10379
opened Nov 15, 2024 by
wilbry
1 task done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.