Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
622 workflow run results
622 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bugfix] Fix block table for seqs that have prefix cache hits
Add label on auto-merge enabled #122: Pull request #7018 auto_merge_enabled by comaniac
August 1, 2024 23:06 10s
August 1, 2024 23:06 10s
[Build/CI] Fixing Docker Hub quota issue.
Add label on auto-merge enabled #121: Pull request #7043 auto_merge_enabled by simon-mo
August 1, 2024 18:07 14s
August 1, 2024 18:07 14s
[Kernel] Fix input for flashinfer prefill wrapper.
Add label on auto-merge enabled #120: Pull request #7008 auto_merge_enabled by LiuXiaoxuanPKU
August 1, 2024 15:34 14s
August 1, 2024 15:34 14s
Revert "[Frontend] Factor out code for running uvicorn"
Add label on auto-merge enabled #119: Pull request #7012 auto_merge_enabled by robertgshaw2-neuralmagic
July 31, 2024 23:10 10s
July 31, 2024 23:10 10s
[Bugfix] Clean up MiniCPM-V
Add label on auto-merge enabled #118: Pull request #6939 auto_merge_enabled by DarkLight1337
July 31, 2024 14:38 14s
July 31, 2024 14:38 14s
[Bugfix] Set SamplingParams.max_tokens for OpenAI requests if not provided by user
Add label on auto-merge enabled #117: Pull request #6954 auto_merge_enabled by DarkLight1337
July 31, 2024 02:37 15s
July 31, 2024 02:37 15s
[Bugfix] fix logit processor excceed vocab size issue
Add label on auto-merge enabled #116: Pull request #6927 auto_merge_enabled by DarkLight1337
July 31, 2024 02:36 10s
July 31, 2024 02:36 10s
[Speculative decoding] Add serving benchmark for llama3 70b + speculative decoding
Add label on auto-merge enabled #115: Pull request #6964 auto_merge_enabled by cadedaniel
July 30, 2024 23:23 10s
July 30, 2024 23:23 10s
[core][misc] improve free_finished_seq_groups
Add label on auto-merge enabled #114: Pull request #6865 auto_merge_enabled by WoosukKwon
July 30, 2024 20:53 10s
July 30, 2024 20:53 10s
[Bugfix] Fix PaliGemma MMP
Add label on auto-merge enabled #113: Pull request #6930 auto_merge_enabled by DarkLight1337
July 30, 2024 07:53 11s
July 30, 2024 07:53 11s
[Kernel] Fix marlin divide-by-zero warnings
Add label on auto-merge enabled #112: Pull request #6904 auto_merge_enabled by mgoin
July 29, 2024 18:49 11s
July 29, 2024 18:49 11s
[Model] Initialize support for InternVL2 series models
Add label on auto-merge enabled #111: Pull request #6514 auto_merge_enabled by ywang96
July 29, 2024 08:49 14s
July 29, 2024 08:49 14s
[Core] Reduce unnecessary compute when logprobs=None
Add label on auto-merge enabled #110: Pull request #6532 auto_merge_enabled by comaniac
July 29, 2024 05:17 12s
July 29, 2024 05:17 12s
[Misc] Pass cutlass_fp8_supported correctly in fbgemm_fp8
Add label on auto-merge enabled #109: Pull request #6871 auto_merge_enabled by robertgshaw2-neuralmagic
July 28, 2024 13:14 12s
July 28, 2024 13:14 12s
Add Nemotron to PP_SUPPORTED_MODELS
Add label on auto-merge enabled #108: Pull request #6863 auto_merge_enabled by mgoin
July 27, 2024 21:52 13s
July 27, 2024 21:52 13s
[Kernel] Remove scaled_fp8_quant kernel padding footgun
Add label on auto-merge enabled #107: Pull request #6842 auto_merge_enabled by DarkLight1337
July 27, 2024 10:27 10s
July 27, 2024 10:27 10s
[Model] Initial support for BLIP-2
Add label on auto-merge enabled #106: Pull request #5920 auto_merge_enabled by DarkLight1337
July 27, 2024 10:00 11s
July 27, 2024 10:00 11s
[bugfix] make args.stream work
Add label on auto-merge enabled #105: Pull request #6831 auto_merge_enabled by DarkLight1337
July 27, 2024 09:04 1m 42s
July 27, 2024 09:04 1m 42s
[CI/Build][Doc] Update CI and Doc for VLM example changes
Add label on auto-merge enabled #104: Pull request #6860 auto_merge_enabled by ywang96
July 27, 2024 07:22 10s
July 27, 2024 07:22 10s
[ROCm] Upgrade PyTorch nightly version
Add label on auto-merge enabled #103: Pull request #6845 auto_merge_enabled by WoosukKwon
July 27, 2024 03:15 11s
July 27, 2024 03:15 11s
[Bug Fix] Illegal memory access, FP8 Llama 3.1 405b
Add label on auto-merge enabled #102: Pull request #6852 auto_merge_enabled by comaniac
July 27, 2024 02:18 9s
July 27, 2024 02:18 9s
Update README.md
Add label on auto-merge enabled #101: Pull request #6847 auto_merge_enabled by zhuohan123
July 27, 2024 00:25 12s
July 27, 2024 00:25 12s
[TPU] Support collective communications in XLA devices
Add label on auto-merge enabled #100: Pull request #6813 auto_merge_enabled by WoosukKwon
July 26, 2024 23:01 14s
July 26, 2024 23:01 14s
enforce eager mode with bnb quantization temporarily
Add label on auto-merge enabled #99: Pull request #6846 auto_merge_enabled by mgoin
July 26, 2024 21:57 10s
July 26, 2024 21:57 10s
[Bugfix][Kernel] Promote another index to int64_t
Add label on auto-merge enabled #98: Pull request #6838 auto_merge_enabled by WoosukKwon
July 26, 2024 17:43 13s
July 26, 2024 17:43 13s
ProTip! You can narrow down the results and go further in time using created:<2024-07-26 or the other filters available.