Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CI] Fix Distributed Tests ci/build
#18822 opened May 28, 2025 by NickLucche Loading…
Fix tpu model runner testcase failure tpu Related to Google TPUs v1
#18810 opened May 28, 2025 by CAROLZXYZXY Draft
Respect passed in device overrides in engine args
#18808 opened May 28, 2025 by Adolfo-Karim Loading…
[Frontend] add run batch to CLI documentation Improvements or additions to documentation frontend
#18804 opened May 28, 2025 by reidliu41 Loading…
[Misc] add more info make_client() func v1
#18803 opened May 28, 2025 by googs1025 Loading…
[Deprecation] Disallow pos-args other than model when initializing LLM frontend ready ONLY add when PR is ready to merge/full CI is needed
#18802 opened May 28, 2025 by DarkLight1337 Loading…
[Deprecation] Remove prompt_token_ids arg fallback in LLM.generate and LLM.embed documentation Improvements or additions to documentation frontend ready ONLY add when PR is ready to merge/full CI is needed structured-output v1
#18800 opened May 28, 2025 by DarkLight1337 Loading…
[Deprecation] Remove inputs arg fallback in Engine classes ready ONLY add when PR is ready to merge/full CI is needed
#18799 opened May 28, 2025 by DarkLight1337 Loading…
[Model] Add support for normalized Transformer (nGPT) from NVIDIA documentation Improvements or additions to documentation
#18798 opened May 28, 2025 by shan18 Loading…
decrement server_load on listen for disconnect frontend ready ONLY add when PR is ready to merge/full CI is needed
#18784 opened May 28, 2025 by daniel-salib Loading…
[Docs] Add developer doc about CI failures documentation Improvements or additions to documentation
#18782 opened May 27, 2025 by russellb Loading…
Fail request if FSM fails to advance v1
#18780 opened May 27, 2025 by atbe Loading…
[Perf] Tunings for SM100 FP8 CUTLASS kernel
#18778 opened May 27, 2025 by mgoin Loading…
Export NaNs in logits to scheduler_stats if output is corrupted tpu Related to Google TPUs v1
#18777 opened May 27, 2025 by vladmihailescu Loading…
[Bugfix] Fix for issue 17396
#18773 opened May 27, 2025 by frreiss Loading…
[Torch Nightly]add missing dependency ci/build
#18770 opened May 27, 2025 by yangw-dev Loading…
[WIP] Add a metric to track request failures documentation Improvements or additions to documentation frontend
#18765 opened May 27, 2025 by harche Draft
[Platform][Dist] Make torch distributed process group extendable ready ONLY add when PR is ready to merge/full CI is needed
#18763 opened May 27, 2025 by MengqingCao Loading…
ProTip! Filter pull requests by the default branch with base:main.