Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Benchmark: add H100 suite perf-benchmarks
#6047 opened Jul 1, 2024 by simon-mo Loading…
Create guided_decoding.md
#6045 opened Jul 1, 2024 by w013nad Loading…
model test with cache
#6024 opened Jul 1, 2024 by khluu Draft
Test HF cache
#6016 opened Jul 1, 2024 by khluu Draft
[core]fix prompt limit
#6014 opened Jul 1, 2024 by jikunshang Loading…
[Frontend] Add bad_words_ids sampling parameter
#5986 opened Jun 29, 2024 by Alvant Loading…
[misc][doc] try to add warning for latest html
#5979 opened Jun 29, 2024 by youkaichao Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.