-
-
Notifications
You must be signed in to change notification settings - Fork 6.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Benchmark][Doc] Update throughput benchmark and README
#15998
opened Apr 3, 2025 by
StevenShi-23
Loading…
[Bugfix][kernels] Fix half2float conversion in gguf kernels
ready
ONLY add when PR is ready to merge/full CI is needed
#15995
opened Apr 3, 2025 by
Isotr0py
Loading…
[TPU] Switch Test to Non-Sliding Window
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
#15981
opened Apr 3, 2025 by
robertgshaw2-redhat
Loading…
[V0][V1][Core] Add outlines integration for V1, and update V0 integration.
v1
#15975
opened Apr 3, 2025 by
unaidedelf8777
•
Draft
[BUGFIX] resolve bug preventing lm_eval working with vllm v1 engine
v1
#15971
opened Apr 2, 2025 by
brian-dellabetta
Loading…
Revert "Add minimum version for ONLY add when PR is ready to merge/full CI is needed
huggingface_hub
to enable Xet downloads"
ci/build
needs-rebase
ready
#15963
opened Apr 2, 2025 by
robertgshaw2-redhat
Loading…
Add support to modelopt quantization of Mixtral model
#15961
opened Apr 2, 2025 by
yueshen2016
Loading…
[Core][P/D Disagg] KV Connector API for v1 disaggregated prefill support
ci/build
documentation
Improvements or additions to documentation
v1
#15960
opened Apr 2, 2025 by
ApostaC
Loading…
[SupportsQuant] Chameleon, Chatglm, Commandr
ci/build
documentation
Improvements or additions to documentation
frontend
structured-output
v1
#15952
opened Apr 2, 2025 by
kylesayrs
Loading…
[Frontend] Support guidance:no-additional-properties for compatibility with xgrammar
structured-output
#15949
opened Apr 2, 2025 by
tjohnson31415
Loading…
[Bugfix] fix use_atomic_add support of marlin kernel when using v1 engine
#15946
opened Apr 2, 2025 by
jinzhen-lin
Loading…
[Hardware][Gaudi][BugFix] fix arguments of hpu fused moe
#15945
opened Apr 2, 2025 by
zhenwei-intel
Loading…
[Model] use AutoWeightsLoader for baichuan, gpt-neox, mpt
#15939
opened Apr 2, 2025 by
jonghyunchoe
Loading…
[Bugfix] Enforce Ray backend for TPU/HPU as documented.
#15937
opened Apr 2, 2025 by
gitover22
Loading…
[WIP][Kernel] Enable FP16 and BF16 CUTLASS MoE kernels
#15932
opened Apr 2, 2025 by
ElizaWszola
•
Draft
Fixed Stream set to True, client stream receiving arguments, concatenated json string, missing curly braces end
frontend
#15930
opened Apr 2, 2025 by
xiaodizi
Loading…
[doc] update contribution link
documentation
Improvements or additions to documentation
#15922
opened Apr 2, 2025 by
reidliu41
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-03-03.