Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
494 workflow run results
494 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Core] Allow specifying custom Executor
Add label on auto-merge enabled #44: Pull request #6557 auto_merge_enabled by comaniac
July 19, 2024 04:57 12s
July 19, 2024 04:57 12s
[Bugfix][Frontend] Fix missing /metrics endpoint
Add label on auto-merge enabled #43: Pull request #6463 auto_merge_enabled by simon-mo
July 19, 2024 02:00 11s
July 19, 2024 02:00 11s
Add support for a rope extension method
Add label on auto-merge enabled #42: Pull request #6553 auto_merge_enabled by simon-mo
July 19, 2024 01:16 1m 17s
July 19, 2024 01:16 1m 17s
[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm
Add label on auto-merge enabled #41: Pull request #6552 auto_merge_enabled by robertgshaw2-neuralmagic
July 18, 2024 22:00 13s
July 18, 2024 22:00 13s
[ci][test] add correctness test for cpu offloading
Add label on auto-merge enabled #40: Pull request #6549 auto_merge_enabled by youkaichao
July 18, 2024 21:49 12s
July 18, 2024 21:49 12s
[Misc] Small perf improvements
Add label on auto-merge enabled #39: Pull request #6520 auto_merge_enabled by Yard1
July 18, 2024 20:13 15s
July 18, 2024 20:13 15s
[Model] Support Mistral-Nemo
Add label on auto-merge enabled #38: Pull request #6548 auto_merge_enabled by mgoin
July 18, 2024 19:35 15s
July 18, 2024 19:35 15s
[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash
Add label on auto-merge enabled #37: Pull request #6501 auto_merge_enabled by comaniac
July 18, 2024 06:10 12s
July 18, 2024 06:10 12s
[Misc] Minor patch for draft model runner
Add label on auto-merge enabled #36: Pull request #6523 auto_merge_enabled by cadedaniel
July 18, 2024 05:39 10s
July 18, 2024 05:39 10s
[BugFix] Avoid secondary error in ShmRingBuffer destructor
Add label on auto-merge enabled #35: Pull request #6530 auto_merge_enabled by youkaichao
July 18, 2024 04:04 15s
July 18, 2024 04:04 15s
[BugFix][Frontend] Use LoRA tokenizer in OpenAI APIs
Add label on auto-merge enabled #34: Pull request #6227 auto_merge_enabled by DarkLight1337
July 18, 2024 00:55 14s
July 18, 2024 00:55 14s
[ Kernel ] FP8 Dynamic-Per-Token Quant Kernel
Add label on auto-merge enabled #33: Pull request #6511 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 23:08 11s
July 17, 2024 23:08 11s
[ Misc ] Improve Min Capability Checking in compressed-tensors
Add label on auto-merge enabled #32: Pull request #6522 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 22:05 11s
July 17, 2024 22:05 11s
[Core] Introduce SPMD worker execution using Ray accelerated DAG
Add label on auto-merge enabled #31: Pull request #6032 auto_merge_enabled by rkooo567
July 17, 2024 21:55 11s
July 17, 2024 21:55 11s
[Misc] Small perf improvements
Add label on auto-merge enabled #30: Pull request #6520 auto_merge_enabled by Yard1
July 17, 2024 21:52 11s
July 17, 2024 21:52 11s
[ Kernel ] Fp8 Channelwise Weight Support
Add label on auto-merge enabled #29: Pull request #6487 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 19:05 11s
July 17, 2024 19:05 11s
[Misc] Use torch.Tensor for type annotation
Add label on auto-merge enabled #28: Pull request #6505 auto_merge_enabled by DarkLight1337
July 17, 2024 12:51 32s
July 17, 2024 12:51 32s
[Bugfix] Fix for multinode crash on 4 PP
Add label on auto-merge enabled #27: Pull request #6495 auto_merge_enabled by youkaichao
July 17, 2024 07:54 11s
July 17, 2024 07:54 11s
[Misc][Speculative decoding] Typos and typing fixes
Add label on auto-merge enabled #26: Pull request #6467 auto_merge_enabled by DarkLight1337
July 17, 2024 03:54 11s
July 17, 2024 03:54 11s
[CI/Build] Update flashinfer to v0.0.9 (#6489)
Add label on auto-merge enabled #25: Pull request #6490 auto_merge_enabled by simon-mo
July 17, 2024 00:03 10s
July 17, 2024 00:03 10s
[Bugfix] Fix Ray Metrics API usage
Add label on auto-merge enabled #24: Pull request #6354 auto_merge_enabled by Yard1
July 16, 2024 23:51 12s
July 16, 2024 23:51 12s
[Misc] Log spec decode metrics
Add label on auto-merge enabled #23: Pull request #6454 auto_merge_enabled by comaniac
July 16, 2024 16:43 14s
July 16, 2024 16:43 14s
[Doc][CI/Build] Update docs and tests to use vllm serve
Add label on auto-merge enabled #22: Pull request #6431 auto_merge_enabled by simon-mo
July 16, 2024 15:33 12s
July 16, 2024 15:33 12s
[Misc] Fix typos in spec. decode metrics logging.
Add label on auto-merge enabled #21: Pull request #6470 auto_merge_enabled by DarkLight1337
July 16, 2024 11:43 14s
July 16, 2024 11:43 14s
[Core] Use numpy to speed up padded token processing
Add label on auto-merge enabled #20: Pull request #6442 auto_merge_enabled by DarkLight1337
July 16, 2024 03:08 11s
July 16, 2024 03:08 11s
ProTip! You can narrow down the results and go further in time using created:<2024-07-16 or the other filters available.