Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
622 workflow run results
622 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bugfix][Frontend] remove duplicate init logger
Add label on auto-merge enabled #47: Pull request #6581 auto_merge_enabled by DarkLight1337
July 19, 2024 14:19 13s
July 19, 2024 14:19 13s
[BUGFIX] Raise an error for no draft token case when draft_tp>1
Add label on auto-merge enabled #46: Pull request #6369 auto_merge_enabled by cadedaniel
July 19, 2024 08:21 9s
July 19, 2024 08:21 9s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #45: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 05:06 11s
July 19, 2024 05:06 11s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #44: Pull request #6557 auto_merge_enabled by comaniac
July 19, 2024 04:57 12s
July 19, 2024 04:57 12s
[Bugfix][Frontend] Fix missing /metrics endpoint
Add label on auto-merge enabled #43: Pull request #6463 auto_merge_enabled by simon-mo
July 19, 2024 02:00 11s
July 19, 2024 02:00 11s
Add support for a rope extension method
Add label on auto-merge enabled #42: Pull request #6553 auto_merge_enabled by simon-mo
July 19, 2024 01:16 1m 17s
July 19, 2024 01:16 1m 17s
[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm
Add label on auto-merge enabled #41: Pull request #6552 auto_merge_enabled by robertgshaw2-neuralmagic
July 18, 2024 22:00 13s
July 18, 2024 22:00 13s
[ci][test] add correctness test for cpu offloading
Add label on auto-merge enabled #40: Pull request #6549 auto_merge_enabled by youkaichao
July 18, 2024 21:49 12s
July 18, 2024 21:49 12s
[Misc] Small perf improvements
Add label on auto-merge enabled #39: Pull request #6520 auto_merge_enabled by Yard1
July 18, 2024 20:13 15s
July 18, 2024 20:13 15s
[Model] Support Mistral-Nemo
Add label on auto-merge enabled #38: Pull request #6548 auto_merge_enabled by mgoin
July 18, 2024 19:35 15s
July 18, 2024 19:35 15s
[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash
Add label on auto-merge enabled #37: Pull request #6501 auto_merge_enabled by comaniac
July 18, 2024 06:10 12s
July 18, 2024 06:10 12s
[Misc] Minor patch for draft model runner
Add label on auto-merge enabled #36: Pull request #6523 auto_merge_enabled by cadedaniel
July 18, 2024 05:39 10s
July 18, 2024 05:39 10s
[BugFix] Avoid secondary error in ShmRingBuffer destructor
Add label on auto-merge enabled #35: Pull request #6530 auto_merge_enabled by youkaichao
July 18, 2024 04:04 15s
July 18, 2024 04:04 15s
[BugFix][Frontend] Use LoRA tokenizer in OpenAI APIs
Add label on auto-merge enabled #34: Pull request #6227 auto_merge_enabled by DarkLight1337
July 18, 2024 00:55 14s
July 18, 2024 00:55 14s
[ Kernel ] FP8 Dynamic-Per-Token Quant Kernel
Add label on auto-merge enabled #33: Pull request #6511 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 23:08 11s
July 17, 2024 23:08 11s
[ Misc ] Improve Min Capability Checking in compressed-tensors
Add label on auto-merge enabled #32: Pull request #6522 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 22:05 11s
July 17, 2024 22:05 11s
[Core] Introduce SPMD worker execution using Ray accelerated DAG
Add label on auto-merge enabled #31: Pull request #6032 auto_merge_enabled by rkooo567
July 17, 2024 21:55 11s
July 17, 2024 21:55 11s
[Misc] Small perf improvements
Add label on auto-merge enabled #30: Pull request #6520 auto_merge_enabled by Yard1
July 17, 2024 21:52 11s
July 17, 2024 21:52 11s
[ Kernel ] Fp8 Channelwise Weight Support
Add label on auto-merge enabled #29: Pull request #6487 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 19:05 11s
July 17, 2024 19:05 11s
[Misc] Use torch.Tensor for type annotation
Add label on auto-merge enabled #28: Pull request #6505 auto_merge_enabled by DarkLight1337
July 17, 2024 12:51 32s
July 17, 2024 12:51 32s
[Bugfix] Fix for multinode crash on 4 PP
Add label on auto-merge enabled #27: Pull request #6495 auto_merge_enabled by youkaichao
July 17, 2024 07:54 11s
July 17, 2024 07:54 11s
[Misc][Speculative decoding] Typos and typing fixes
Add label on auto-merge enabled #26: Pull request #6467 auto_merge_enabled by DarkLight1337
July 17, 2024 03:54 11s
July 17, 2024 03:54 11s
[CI/Build] Update flashinfer to v0.0.9 (#6489)
Add label on auto-merge enabled #25: Pull request #6490 auto_merge_enabled by simon-mo
July 17, 2024 00:03 10s
July 17, 2024 00:03 10s
[Bugfix] Fix Ray Metrics API usage
Add label on auto-merge enabled #24: Pull request #6354 auto_merge_enabled by Yard1
July 16, 2024 23:51 12s
July 16, 2024 23:51 12s
[Misc] Log spec decode metrics
Add label on auto-merge enabled #23: Pull request #6454 auto_merge_enabled by comaniac
July 16, 2024 16:43 14s
July 16, 2024 16:43 14s
ProTip! You can narrow down the results and go further in time using created:<2024-07-16 or the other filters available.