Skip to content

Actions: MengqingCao/vllm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
297 workflow runs
297 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[CI/Build] Fix pre-commit errors (#13696)
pre-commit #21: Commit 7f6bae5 pushed by MengqingCao
February 22, 2025 10:28 5m 45s main
February 22, 2025 10:28 5m 45s
Close inactive issues and PRs
Close inactive issues and PRs #111: Scheduled
February 22, 2025 02:28 14s main
February 22, 2025 02:28 14s
[Bugfix][CPU] Fix cpu all-reduce using native pytorch implementation …
pre-commit #20: Commit b2c3fc5 pushed by MengqingCao
February 21, 2025 07:26 3m 50s main
February 21, 2025 07:26 3m 50s
[Neuron][Kernel] Vectorize KV cache load in FlashPagedAttention to ma…
pre-commit #19: Commit 3317008 pushed by MengqingCao
February 21, 2025 02:40 5m 37s main
February 21, 2025 02:40 5m 37s
Close inactive issues and PRs
Close inactive issues and PRs #110: Scheduled
February 21, 2025 02:32 13s main
February 21, 2025 02:32 13s
Close inactive issues and PRs
Close inactive issues and PRs #109: Scheduled
February 20, 2025 02:32 11s main
February 20, 2025 02:32 11s
Close inactive issues and PRs
Close inactive issues and PRs #108: Scheduled
February 19, 2025 02:32 12s main
February 19, 2025 02:32 12s
[Bugfix][CI][V1] Work around V1 + CUDA Graph + torch._scaled_mm fallb…
pre-commit #18: Commit b3942e1 pushed by MengqingCao
February 18, 2025 02:32 4m 35s main
February 18, 2025 02:32 4m 35s
Close inactive issues and PRs
Close inactive issues and PRs #107: Scheduled
February 18, 2025 02:30 11s main
February 18, 2025 02:30 11s
[Feature][Spec Decode] Simplify the use of Eagle Spec Decode (#12304)
pre-commit #17: Commit 46cdd59 pushed by MengqingCao
February 17, 2025 06:04 4m 41s main
February 17, 2025 06:04 4m 41s
Close inactive issues and PRs
Close inactive issues and PRs #106: Scheduled
February 17, 2025 02:35 10s main
February 17, 2025 02:35 10s
[V1][BugFix] Clean up rejection sampler & Fix warning msg (#13362)
pre-commit #16: Commit 69e1d23 pushed by MengqingCao
February 17, 2025 01:28 6m 8s main
February 17, 2025 01:28 6m 8s
Close inactive issues and PRs
Close inactive issues and PRs #105: Scheduled
February 16, 2025 02:36 10s main
February 16, 2025 02:36 10s
Close inactive issues and PRs
Close inactive issues and PRs #104: Scheduled
February 15, 2025 02:29 11s main
February 15, 2025 02:29 11s
Close inactive issues and PRs
Close inactive issues and PRs #103: Scheduled
February 14, 2025 02:31 10s main
February 14, 2025 02:31 10s
Close inactive issues and PRs
Close inactive issues and PRs #102: Scheduled
February 13, 2025 02:31 12s main
February 13, 2025 02:31 12s
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAtt…
pre-commit #15: Commit e92694b pushed by MengqingCao
February 12, 2025 06:16 5m 51s main
February 12, 2025 06:16 5m 51s
Close inactive issues and PRs
Close inactive issues and PRs #101: Scheduled
February 12, 2025 02:31 12s main
February 12, 2025 02:31 12s
[V1][Metrics] Add GPU prefix cache hit rate % gauge (#12592)
pre-commit #14: Commit 41c5dd4 pushed by MengqingCao
February 11, 2025 08:47 5m 38s main
February 11, 2025 08:47 5m 38s
Close inactive issues and PRs
Close inactive issues and PRs #100: Scheduled
February 11, 2025 02:32 15s main
February 11, 2025 02:32 15s
[Doc] Add link to tool_choice tracking issue in tool_calling.md (#13003)
pre-commit #13: Commit 2431371 pushed by MengqingCao
February 10, 2025 09:06 5m 44s main
February 10, 2025 09:06 5m 44s
Close inactive issues and PRs
Close inactive issues and PRs #99: Scheduled
February 10, 2025 02:32 14s main
February 10, 2025 02:32 14s
Close inactive issues and PRs
Close inactive issues and PRs #98: Scheduled
February 9, 2025 02:33 13s main
February 9, 2025 02:33 13s
Close inactive issues and PRs
Close inactive issues and PRs #97: Scheduled
February 8, 2025 02:26 16s main
February 8, 2025 02:26 16s
Close inactive issues and PRs
Close inactive issues and PRs #96: Scheduled
February 7, 2025 02:32 11s main
February 7, 2025 02:32 11s