Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,281 workflow runs
1,281 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Misc] Refactor linear layer weight loading; introduce BasevLLMParameter and weight_loader_v2
Add label on auto-merge enabled #131: Pull request #5874 auto_merge_enabled by robertgshaw2-redhat
August 5, 2024 21:54 10s
August 5, 2024 21:54 10s
[Bugfix][CI/Build] Fix CUTLASS FetchContent
Add label on auto-merge enabled #130: Pull request #7171 auto_merge_enabled by tlrmchlsmth
August 5, 2024 21:13 15s
August 5, 2024 21:13 15s
[LoRA] Relax LoRA condition
Add label on auto-merge enabled #129: Pull request #7146 auto_merge_enabled by Yard1
August 5, 2024 18:36 14s
August 5, 2024 18:36 14s
[Model] SiglipVisionModel ported from transformers
Add label on auto-merge enabled #128: Pull request #6942 auto_merge_enabled by ywang96
August 5, 2024 05:17 11s
August 5, 2024 05:17 11s
[Speculative decoding] Add periodic log with time spent in proposal/scoring/verification
Add label on auto-merge enabled #127: Pull request #6963 auto_merge_enabled by cadedaniel
August 4, 2024 17:19 12s
August 4, 2024 17:19 12s
[Model]Refactor MiniCPMV
Add label on auto-merge enabled #126: Pull request #7020 auto_merge_enabled by DarkLight1337
August 4, 2024 07:24 10s
August 4, 2024 07:24 10s
Support for guided decoding for offline LLM
Add label on auto-merge enabled #125: Pull request #6878 auto_merge_enabled by DarkLight1337
August 4, 2024 01:24 12s
August 4, 2024 01:24 12s
Support for guided decoding for offline LLM
Add label on auto-merge enabled #124: Pull request #6878 auto_merge_enabled by simon-mo
August 3, 2024 05:54 13s
August 3, 2024 05:54 13s
[Frontend] Factor out chat message parsing
Add label on auto-merge enabled #123: Pull request #7055 auto_merge_enabled by DarkLight1337
August 3, 2024 02:36 12s
August 3, 2024 02:36 12s
[Bugfix] Fix block table for seqs that have prefix cache hits
Add label on auto-merge enabled #122: Pull request #7018 auto_merge_enabled by comaniac
August 1, 2024 23:06 10s
August 1, 2024 23:06 10s
[Build/CI] Fixing Docker Hub quota issue.
Add label on auto-merge enabled #121: Pull request #7043 auto_merge_enabled by simon-mo
August 1, 2024 18:07 14s
August 1, 2024 18:07 14s
[Kernel] Fix input for flashinfer prefill wrapper.
Add label on auto-merge enabled #120: Pull request #7008 auto_merge_enabled by LiuXiaoxuanPKU
August 1, 2024 15:34 14s
August 1, 2024 15:34 14s
Revert "[Frontend] Factor out code for running uvicorn"
Add label on auto-merge enabled #119: Pull request #7012 auto_merge_enabled by robertgshaw2-redhat
July 31, 2024 23:10 10s
July 31, 2024 23:10 10s
[Bugfix] Clean up MiniCPM-V
Add label on auto-merge enabled #118: Pull request #6939 auto_merge_enabled by DarkLight1337
July 31, 2024 14:38 14s
July 31, 2024 14:38 14s
[Bugfix] Set SamplingParams.max_tokens for OpenAI requests if not provided by user
Add label on auto-merge enabled #117: Pull request #6954 auto_merge_enabled by DarkLight1337
July 31, 2024 02:37 15s
July 31, 2024 02:37 15s
[Bugfix] fix logit processor excceed vocab size issue
Add label on auto-merge enabled #116: Pull request #6927 auto_merge_enabled by DarkLight1337
July 31, 2024 02:36 10s
July 31, 2024 02:36 10s
[Speculative decoding] Add serving benchmark for llama3 70b + speculative decoding
Add label on auto-merge enabled #115: Pull request #6964 auto_merge_enabled by cadedaniel
July 30, 2024 23:23 10s
July 30, 2024 23:23 10s
[core][misc] improve free_finished_seq_groups
Add label on auto-merge enabled #114: Pull request #6865 auto_merge_enabled by WoosukKwon
July 30, 2024 20:53 10s
July 30, 2024 20:53 10s
[Bugfix] Fix PaliGemma MMP
Add label on auto-merge enabled #113: Pull request #6930 auto_merge_enabled by DarkLight1337
July 30, 2024 07:53 11s
July 30, 2024 07:53 11s
[Kernel] Fix marlin divide-by-zero warnings
Add label on auto-merge enabled #112: Pull request #6904 auto_merge_enabled by mgoin
July 29, 2024 18:49 11s
July 29, 2024 18:49 11s
[Model] Initialize support for InternVL2 series models
Add label on auto-merge enabled #111: Pull request #6514 auto_merge_enabled by ywang96
July 29, 2024 08:49 14s
July 29, 2024 08:49 14s
[Core] Reduce unnecessary compute when logprobs=None
Add label on auto-merge enabled #110: Pull request #6532 auto_merge_enabled by comaniac
July 29, 2024 05:17 12s
July 29, 2024 05:17 12s
[Misc] Pass cutlass_fp8_supported correctly in fbgemm_fp8
Add label on auto-merge enabled #109: Pull request #6871 auto_merge_enabled by robertgshaw2-redhat
July 28, 2024 13:14 12s
July 28, 2024 13:14 12s
Add Nemotron to PP_SUPPORTED_MODELS
Add label on auto-merge enabled #108: Pull request #6863 auto_merge_enabled by mgoin
July 27, 2024 21:52 13s
July 27, 2024 21:52 13s
[Kernel] Remove scaled_fp8_quant kernel padding footgun
Add label on auto-merge enabled #107: Pull request #6842 auto_merge_enabled by DarkLight1337
July 27, 2024 10:27 10s
July 27, 2024 10:27 10s
ProTip! You can narrow down the results and go further in time using created:<2024-07-27 or the other filters available.