Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
603 workflow run results
603 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[ Kernel ] FP8 Dynamic Per Token Quant - Add scale_ub
Add label on auto-merge enabled #53: Pull request #6593 auto_merge_enabled by mgoin
July 20, 2024 00:35 12s
July 20, 2024 00:35 12s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #52: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 23:34 14s
July 19, 2024 23:34 14s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #51: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 22:27 10s
July 19, 2024 22:27 10s
[Bugfix] [SpecDecode] AsyncMetricsCollector: update time since last collection
Add label on auto-merge enabled #50: Pull request #6578 auto_merge_enabled by cadedaniel
July 19, 2024 20:53 11s
July 19, 2024 20:53 11s
[ Kernel ] Enable Dynamic Per Token fp8
Add label on auto-merge enabled #49: Pull request #6547 auto_merge_enabled by robertgshaw2-neuralmagic
July 19, 2024 18:34 13s
July 19, 2024 18:34 13s
[Misc] Fix input_scale typing in w8a8_utils.py
Add label on auto-merge enabled #48: Pull request #6579 auto_merge_enabled by mgoin
July 19, 2024 14:31 11s
July 19, 2024 14:31 11s
[Bugfix][Frontend] remove duplicate init logger
Add label on auto-merge enabled #47: Pull request #6581 auto_merge_enabled by DarkLight1337
July 19, 2024 14:19 13s
July 19, 2024 14:19 13s
[BUGFIX] Raise an error for no draft token case when draft_tp>1
Add label on auto-merge enabled #46: Pull request #6369 auto_merge_enabled by cadedaniel
July 19, 2024 08:21 9s
July 19, 2024 08:21 9s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #45: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 05:06 11s
July 19, 2024 05:06 11s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #44: Pull request #6557 auto_merge_enabled by comaniac
July 19, 2024 04:57 12s
July 19, 2024 04:57 12s
[Bugfix][Frontend] Fix missing /metrics endpoint
Add label on auto-merge enabled #43: Pull request #6463 auto_merge_enabled by simon-mo
July 19, 2024 02:00 11s
July 19, 2024 02:00 11s
Add support for a rope extension method
Add label on auto-merge enabled #42: Pull request #6553 auto_merge_enabled by simon-mo
July 19, 2024 01:16 1m 17s
July 19, 2024 01:16 1m 17s
[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm
Add label on auto-merge enabled #41: Pull request #6552 auto_merge_enabled by robertgshaw2-neuralmagic
July 18, 2024 22:00 13s
July 18, 2024 22:00 13s
[ci][test] add correctness test for cpu offloading
Add label on auto-merge enabled #40: Pull request #6549 auto_merge_enabled by youkaichao
July 18, 2024 21:49 12s
July 18, 2024 21:49 12s
[Misc] Small perf improvements
Add label on auto-merge enabled #39: Pull request #6520 auto_merge_enabled by Yard1
July 18, 2024 20:13 15s
July 18, 2024 20:13 15s
[Model] Support Mistral-Nemo
Add label on auto-merge enabled #38: Pull request #6548 auto_merge_enabled by mgoin
July 18, 2024 19:35 15s
July 18, 2024 19:35 15s
[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash
Add label on auto-merge enabled #37: Pull request #6501 auto_merge_enabled by comaniac
July 18, 2024 06:10 12s
July 18, 2024 06:10 12s
[Misc] Minor patch for draft model runner
Add label on auto-merge enabled #36: Pull request #6523 auto_merge_enabled by cadedaniel
July 18, 2024 05:39 10s
July 18, 2024 05:39 10s
[BugFix] Avoid secondary error in ShmRingBuffer destructor
Add label on auto-merge enabled #35: Pull request #6530 auto_merge_enabled by youkaichao
July 18, 2024 04:04 15s
July 18, 2024 04:04 15s
[BugFix][Frontend] Use LoRA tokenizer in OpenAI APIs
Add label on auto-merge enabled #34: Pull request #6227 auto_merge_enabled by DarkLight1337
July 18, 2024 00:55 14s
July 18, 2024 00:55 14s
[ Kernel ] FP8 Dynamic-Per-Token Quant Kernel
Add label on auto-merge enabled #33: Pull request #6511 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 23:08 11s
July 17, 2024 23:08 11s
[ Misc ] Improve Min Capability Checking in compressed-tensors
Add label on auto-merge enabled #32: Pull request #6522 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 22:05 11s
July 17, 2024 22:05 11s
[Core] Introduce SPMD worker execution using Ray accelerated DAG
Add label on auto-merge enabled #31: Pull request #6032 auto_merge_enabled by rkooo567
July 17, 2024 21:55 11s
July 17, 2024 21:55 11s
[Misc] Small perf improvements
Add label on auto-merge enabled #30: Pull request #6520 auto_merge_enabled by Yard1
July 17, 2024 21:52 11s
July 17, 2024 21:52 11s
[ Kernel ] Fp8 Channelwise Weight Support
Add label on auto-merge enabled #29: Pull request #6487 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 19:05 11s
July 17, 2024 19:05 11s
ProTip! You can narrow down the results and go further in time using created:<2024-07-17 or the other filters available.