Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Bugfix][kernels] Fix half2float conversion in gguf kernels ready ONLY add when PR is ready to merge/full CI is needed
#15995 opened Apr 3, 2025 by Isotr0py Loading…
Use custom address for listening socket
#15988 opened Apr 3, 2025 by jglaser Loading…
[TPU] Switch Test to Non-Sliding Window ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs
#15981 opened Apr 3, 2025 by robertgshaw2-redhat Loading…
[Misc] improve gguf check
#15974 opened Apr 3, 2025 by reidliu41 Loading…
Revert "Add minimum version for huggingface_hub to enable Xet downloads" ci/build needs-rebase ready ONLY add when PR is ready to merge/full CI is needed
#15963 opened Apr 2, 2025 by robertgshaw2-redhat Loading…
[Core][P/D Disagg] KV Connector API for v1 disaggregated prefill support ci/build documentation Improvements or additions to documentation v1
#15960 opened Apr 2, 2025 by ApostaC Loading…
DeepGemm MoE expert map support
#15957 opened Apr 2, 2025 by bnellnm Draft
Modular fused experts
#15956 opened Apr 2, 2025 by bnellnm Draft
[SupportsQuant] Chameleon, Chatglm, Commandr ci/build documentation Improvements or additions to documentation frontend structured-output v1
#15952 opened Apr 2, 2025 by kylesayrs Loading…
[doc] update contribution link documentation Improvements or additions to documentation
#15922 opened Apr 2, 2025 by reidliu41 Loading…
ProTip! What’s not been updated in a month: updated:<2025-03-03.