Skip to content

Actions: vllm-project/vllm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
122,359 workflow run results
122,359 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[V1] Further reduce CPU overheads in flash-attn (#10989)
mypy #23930: Commit 3b61cb4 pushed by WoosukKwon
December 9, 2024 20:38 48s main
December 9, 2024 20:38 48s
[Model] Add support for embedding model GritLM
Lint documentation #324: Pull request #10816 synchronize by pooyadavoodi
December 9, 2024 20:26 20s pooyadavoodi:add_gritlm_vllm
December 9, 2024 20:26 20s
[Model] Add support for embedding model GritLM
Lint documentation #323: Pull request #10816 synchronize by pooyadavoodi
December 9, 2024 20:24 22s pooyadavoodi:add_gritlm_vllm
December 9, 2024 20:24 22s
[CI] Add test case with JSON schema using references + use xgrammar by default with OpenAI parse
Cleanup PR Body #1233: Pull request #10935 edited by mgoin
December 9, 2024 19:51 12s
December 9, 2024 19:51 12s
[Core] Support offloading KV cache to CPU
codespell #3130: Pull request #10874 synchronize by ApostaC
December 9, 2024 19:50 20s KuntaiDu:yihua-cpu-offloading2
December 9, 2024 19:50 20s
[Core] Support offloading KV cache to CPU
clang-format #17742: Pull request #10874 synchronize by ApostaC
December 9, 2024 19:50 14s KuntaiDu:yihua-cpu-offloading2
December 9, 2024 19:50 14s
[Core] Support offloading KV cache to CPU
yapf #30000: Pull request #10874 synchronize by ApostaC
December 9, 2024 19:50 1m 50s KuntaiDu:yihua-cpu-offloading2
December 9, 2024 19:50 1m 50s