Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,281 workflow runs
1,281 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Feature]: Add OpenAI server prompt_logprobs support #6508
Add label on auto-merge enabled #181: Pull request #7453 auto_merge_enabled by DarkLight1337
August 16, 2024 00:18 13s
August 16, 2024 00:18 13s
[Feature]: Add OpenAI server prompt_logprobs support #6508
Add label on auto-merge enabled #180: Pull request #7453 auto_merge_enabled by DarkLight1337
August 16, 2024 00:04 19s
August 16, 2024 00:04 19s
[Core] Fix tracking of model forward time to the span traces in case of PP>1
Add label on auto-merge enabled #179: Pull request #7440 auto_merge_enabled by rkooo567
August 15, 2024 20:43 13s
August 15, 2024 20:43 13s
Chat method for offline llm
Add label on auto-merge enabled #178: Pull request #5049 auto_merge_enabled by DarkLight1337
August 15, 2024 16:21 16s
August 15, 2024 16:21 16s
[Misc] Deprecation Warning when setting --engine-use-ray
Add label on auto-merge enabled #177: Pull request #7424 auto_merge_enabled by youkaichao
August 14, 2024 16:44 18s
August 14, 2024 16:44 18s
[VLM][Core] Support profiling with multiple multi-modal inputs per prompt
Add label on auto-merge enabled #176: Pull request #7126 auto_merge_enabled by ywang96
August 14, 2024 16:36 13s
August 14, 2024 16:36 13s
[Bugfix][Frontend] Disable embedding API for chat models
Add label on auto-merge enabled #175: Pull request #7504 auto_merge_enabled by DarkLight1337
August 14, 2024 13:58 12s
August 14, 2024 13:58 12s
[Bugfix][Frontend] Disable embedding API for chat models
Add label on auto-merge enabled #174: Pull request #7504 auto_merge_enabled by DarkLight1337
August 14, 2024 06:26 11s
August 14, 2024 06:26 11s
[Bugfix][Docs] Update list of mock imports
Add label on auto-merge enabled #173: Pull request #7493 auto_merge_enabled by DarkLight1337
August 14, 2024 03:12 12s
August 14, 2024 03:12 12s
[CI/Build] Add text-only test for Qwen models
Add label on auto-merge enabled #172: Pull request #7475 auto_merge_enabled by DarkLight1337
August 14, 2024 01:38 10s
August 14, 2024 01:38 10s
[Frontend][Core] Add plumbing to support audio language models
Add label on auto-merge enabled #171: Pull request #7446 auto_merge_enabled by ywang96
August 13, 2024 16:28 13s
August 13, 2024 16:28 13s
[Frontend][Core] Add plumbing to support audio language models
Add label on auto-merge enabled #170: Pull request #7446 auto_merge_enabled by DarkLight1337
August 13, 2024 15:52 12s
August 13, 2024 15:52 12s
Fix empty output when temp is too low
Add label on auto-merge enabled #169: Pull request #2937 auto_merge_enabled by DarkLight1337
August 13, 2024 14:01 15s
August 13, 2024 14:01 15s
[Bugfix] Fix weight loading for Chameleon when TP>1
Add label on auto-merge enabled #168: Pull request #7410 auto_merge_enabled by DarkLight1337
August 13, 2024 03:38 11s
August 13, 2024 03:38 11s
[Misc] improve logits processors logging message
Add label on auto-merge enabled #167: Pull request #7435 auto_merge_enabled by DarkLight1337
August 13, 2024 02:17 15s
August 13, 2024 02:17 15s
[Core/Bugfix] Add FP8 K/V Scale and dtype conversion for prefix/prefill Triton Kernel
Add label on auto-merge enabled #166: Pull request #7208 auto_merge_enabled by comaniac
August 12, 2024 22:33 13s
August 12, 2024 22:33 13s
[Frontend] Disallow passing model as both argument and option
Add label on auto-merge enabled #165: Pull request #7347 auto_merge_enabled by DarkLight1337
August 12, 2024 11:17 17s
August 12, 2024 11:17 17s
[CI/Build] Minor refactoring for vLLM assets
Add label on auto-merge enabled #164: Pull request #7407 auto_merge_enabled by ywang96
August 12, 2024 01:08 13s
August 12, 2024 01:08 13s
[Bugfix] Fix phi3v batch inference when images have different aspect ratio
Add label on auto-merge enabled #163: Pull request #7392 auto_merge_enabled by DarkLight1337
August 10, 2024 14:18 11s
August 10, 2024 14:18 11s
[Misc] Add numpy implementation of compute_slot_mapping
Add label on auto-merge enabled #162: Pull request #7377 auto_merge_enabled by Yard1
August 9, 2024 22:35 14s
August 9, 2024 22:35 14s
[Core] Fix edge case in chunked prefill + block manager v2
Add label on auto-merge enabled #161: Pull request #7380 auto_merge_enabled by cadedaniel
August 9, 2024 22:20 13s
August 9, 2024 22:20 13s
[Bugfix] Fix PerTensorScaleParameter weight loading for fused models
Add label on auto-merge enabled #160: Pull request #7376 auto_merge_enabled by mgoin
August 9, 2024 20:06 17s
August 9, 2024 20:06 17s
[Bugfix] Fix ITL recording in serving benchmark
Add label on auto-merge enabled #159: Pull request #7372 auto_merge_enabled by ywang96
August 9, 2024 16:59 11s
August 9, 2024 16:59 11s
[Performance] e2e overheads reduction: Small followup diff
Add label on auto-merge enabled #158: Pull request #7364 auto_merge_enabled by comaniac
August 9, 2024 15:48 9s
August 9, 2024 15:48 9s
[VLM][Doc] Add stop_token_ids to InternVL example
Add label on auto-merge enabled #157: Pull request #7354 auto_merge_enabled by DarkLight1337
August 9, 2024 14:27 10s
August 9, 2024 14:27 10s
ProTip! You can narrow down the results and go further in time using created:<2024-08-09 or the other filters available.