-
-
Notifications
You must be signed in to change notification settings - Fork 7.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add DeepSeek-R1-0528 function call chat template
documentation
Improvements or additions to documentation
tool-calling
#18874
opened May 29, 2025 by
Xu-Wenqing
•
Draft
[doc] add CLI doc
documentation
Improvements or additions to documentation
#18871
opened May 29, 2025 by
reidliu41
Loading…
[Model] NemotronH support
documentation
Improvements or additions to documentation
#18863
opened May 28, 2025 by
vegaluisjose
Loading…
[Core] Cast multimodal input in hf processor
multi-modality
Related to multi-modality (#4194)
speculative-decoding
tpu
Related to Google TPUs
v1
#18862
opened May 28, 2025 by
lgeiger
Loading…
[Bugfix] Remove NVFP4 scales assertions to fix load_format=dummy
#18861
opened May 28, 2025 by
mgoin
Loading…
[Bugfix] Ensure tensors are contiguous during serialisation
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#18860
opened May 28, 2025 by
lgeiger
Loading…
Fixes a dead link in nightly benchmark readme
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#18856
opened May 28, 2025 by
nerdalert
Loading…
Fix an error in dummy weight loading for quantization models
#18855
opened May 28, 2025 by
Chenyaaang
Loading…
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets
documentation
Improvements or additions to documentation
#18847
opened May 28, 2025 by
ekagra-ranjan
Loading…
Use standalone_compile by default in torch >= 2.8.0
ready
ONLY add when PR is ready to merge/full CI is needed
#18846
opened May 28, 2025 by
zou3519
Loading…
[Bugfix] Fix misleading information in the documentation
documentation
Improvements or additions to documentation
#18845
opened May 28, 2025 by
jeejeelee
Loading…
[Perf] Tune scaled_fp8_quant by increasing vectorization
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#18844
opened May 28, 2025 by
mgoin
Loading…
[LoRA] Add LoRA support for InternVL
ready
ONLY add when PR is ready to merge/full CI is needed
#18842
opened May 28, 2025 by
jeejeelee
Loading…
[Misc] Split monolithic config.py into domain-specific modules
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
#18830
opened May 28, 2025 by
jeejeelee
Loading…
[Bugfix] AssertionError when releasing waiting requests without KV cache allocation
v1
#18829
opened May 28, 2025 by
Abatom
Loading…
[FEAT][ROCm] Add AITER grouped topk for DeepSeekV2
rocm
Related to AMD ROCm
#18825
opened May 28, 2025 by
vllmellm
Loading…
[Misc] refactor: simplify EngineCoreClient.make_async_mp_client in AsyncLLM
v1
#18817
opened May 28, 2025 by
googs1025
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.