Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
603 workflow run results
603 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[ROCm] Upgrade PyTorch nightly version
Add label on auto-merge enabled #103: Pull request #6845 auto_merge_enabled by WoosukKwon
July 27, 2024 03:15 11s
July 27, 2024 03:15 11s
[Bug Fix] Illegal memory access, FP8 Llama 3.1 405b
Add label on auto-merge enabled #102: Pull request #6852 auto_merge_enabled by comaniac
July 27, 2024 02:18 9s
July 27, 2024 02:18 9s
Update README.md
Add label on auto-merge enabled #101: Pull request #6847 auto_merge_enabled by zhuohan123
July 27, 2024 00:25 12s
July 27, 2024 00:25 12s
[TPU] Support collective communications in XLA devices
Add label on auto-merge enabled #100: Pull request #6813 auto_merge_enabled by WoosukKwon
July 26, 2024 23:01 14s
July 26, 2024 23:01 14s
enforce eager mode with bnb quantization temporarily
Add label on auto-merge enabled #99: Pull request #6846 auto_merge_enabled by mgoin
July 26, 2024 21:57 10s
July 26, 2024 21:57 10s
[Bugfix][Kernel] Promote another index to int64_t
Add label on auto-merge enabled #98: Pull request #6838 auto_merge_enabled by WoosukKwon
July 26, 2024 17:43 13s
July 26, 2024 17:43 13s
[Misc][TPU] Support TPU in initialize_ray_cluster
Add label on auto-merge enabled #97: Pull request #6812 auto_merge_enabled by WoosukKwon
July 26, 2024 17:28 11s
July 26, 2024 17:28 11s
[Doc] Add missing mock import to docs conf.py
Add label on auto-merge enabled #96: Pull request #6834 auto_merge_enabled by DarkLight1337
July 26, 2024 15:57 11s
July 26, 2024 15:57 11s
[Doc] Add missing mock import to docs conf.py
Add label on auto-merge enabled #95: Pull request #6834 auto_merge_enabled by DarkLight1337
July 26, 2024 13:29 16s
July 26, 2024 13:29 16s
[doc][debugging] add known issues for hangs
Add label on auto-merge enabled #94: Pull request #6816 auto_merge_enabled by DarkLight1337
July 26, 2024 04:45 9s
July 26, 2024 04:45 9s
[Frontend] New allowed_token_ids decoding request parameter
Add label on auto-merge enabled #93: Pull request #6753 auto_merge_enabled by DarkLight1337
July 26, 2024 02:26 11s
July 26, 2024 02:26 11s
[Docs] Publish 5th meetup slides
Add label on auto-merge enabled #92: Pull request #6799 auto_merge_enabled by zhuohan123
July 25, 2024 23:46 12s
July 25, 2024 23:46 12s
[Bugfix] Fix empty (nullptr) channelwise scales when loading wNa16 using compressed tensors
Add label on auto-merge enabled #91: Pull request #6798 auto_merge_enabled by mgoin
July 25, 2024 21:28 13s
July 25, 2024 21:28 13s
Fix ReplicatedLinear weight loading
Add label on auto-merge enabled #90: Pull request #6793 auto_merge_enabled by comaniac
July 25, 2024 20:08 11s
July 25, 2024 20:08 11s
[Core] Fix ray forward_dag error mssg
Add label on auto-merge enabled #89: Pull request #6792 auto_merge_enabled by simon-mo
July 25, 2024 17:04 12s
July 25, 2024 17:04 12s
[Bugfix] Add image placeholder for OpenAI Compatible Server of MiniCPM-V
Add label on auto-merge enabled #88: Pull request #6787 auto_merge_enabled by DarkLight1337
July 25, 2024 16:30 17s
July 25, 2024 16:30 17s
[Bugfix] Fix kv_cache_dtype=fp8 without scales for FP8 checkpoints
Add label on auto-merge enabled #87: Pull request #6761 auto_merge_enabled by mgoin
July 25, 2024 14:51 12s
July 25, 2024 14:51 12s
[Bugfix] Fix encoding_format in examples/openai_embedding_client.py
Add label on auto-merge enabled #86: Pull request #6755 auto_merge_enabled by DarkLight1337
July 25, 2024 02:21 11s
July 25, 2024 02:21 11s
[ Misc ] fp8-marlin channelwise via compressed-tensors
Add label on auto-merge enabled #85: Pull request #6524 auto_merge_enabled by mgoin
July 25, 2024 00:45 11s
July 25, 2024 00:45 11s
[Bugfix] Fix decode tokens w. CUDA graph
Add label on auto-merge enabled #84: Pull request #6757 auto_merge_enabled by comaniac
July 24, 2024 20:40 13s
July 24, 2024 20:40 13s
[Bugfix] Bump transformers to 4.43.2
Add label on auto-merge enabled #83: Pull request #6752 auto_merge_enabled by mgoin
July 24, 2024 18:55 13s
July 24, 2024 18:55 13s
[Bugfix] Fix awq_marlin and gptq_marlin flags
Add label on auto-merge enabled #82: Pull request #6745 auto_merge_enabled by mgoin
July 24, 2024 18:48 15s
July 24, 2024 18:48 15s
[Frontend] split run_server into build_server and run_server
Add label on auto-merge enabled #81: Pull request #6740 auto_merge_enabled by simon-mo
July 24, 2024 15:54 18s
July 24, 2024 15:54 18s
Adding f-string to validation error which is missing
Add label on auto-merge enabled #80: Pull request #6748 auto_merge_enabled by comaniac
July 24, 2024 15:52 14s
July 24, 2024 15:52 14s
[Bugfix] Miscalculated latency lead to time_to_first_token_seconds inaccurate.
Add label on auto-merge enabled #79: Pull request #6686 auto_merge_enabled by comaniac
July 24, 2024 15:44 11s
July 24, 2024 15:44 11s
ProTip! You can narrow down the results and go further in time using created:<2024-07-24 or the other filters available.