Skip to content

Actions: MengqingCao/vllm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
280 workflow runs
280 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Improve pipeline partitioning (#13839)
pre-commit #25: Commit 145944c pushed by MengqingCao
February 26, 2025 06:25 5m 0s main
February 26, 2025 06:25 5m 0s
Close inactive issues and PRs
Close inactive issues and PRs #115: Scheduled
February 26, 2025 02:35 12s main
February 26, 2025 02:35 12s
[Bugfix] Initialize attention bias on the same device as Query/Key/Va…
pre-commit #24: Commit 75e9d49 pushed by MengqingCao
February 25, 2025 10:53 4m 50s main
February 25, 2025 10:53 4m 50s
[Bugfix] Support MLA for CompressedTensorsWNA16 (#13725)
pre-commit #23: Commit 18e5059 pushed by MengqingCao
February 25, 2025 07:35 5m 0s main
February 25, 2025 07:35 5m 0s
Close inactive issues and PRs
Close inactive issues and PRs #114: Scheduled
February 25, 2025 02:35 10s main
February 25, 2025 02:35 10s
Close inactive issues and PRs
Close inactive issues and PRs #113: Scheduled
February 24, 2025 02:36 14s main
February 24, 2025 02:36 14s
[V1][BugFix] Fix engine core client shutdown hangs (#13298)
pre-commit #22: Commit cbae7af pushed by MengqingCao
February 24, 2025 01:25 4m 47s main
February 24, 2025 01:25 4m 47s
Close inactive issues and PRs
Close inactive issues and PRs #112: Scheduled
February 23, 2025 02:36 13s main
February 23, 2025 02:36 13s
[CI/Build] Fix pre-commit errors (#13696)
pre-commit #21: Commit 7f6bae5 pushed by MengqingCao
February 22, 2025 10:28 5m 45s main
February 22, 2025 10:28 5m 45s
Close inactive issues and PRs
Close inactive issues and PRs #111: Scheduled
February 22, 2025 02:28 14s main
February 22, 2025 02:28 14s
[Bugfix][CPU] Fix cpu all-reduce using native pytorch implementation …
pre-commit #20: Commit b2c3fc5 pushed by MengqingCao
February 21, 2025 07:26 3m 50s main
February 21, 2025 07:26 3m 50s
[Neuron][Kernel] Vectorize KV cache load in FlashPagedAttention to ma…
pre-commit #19: Commit 3317008 pushed by MengqingCao
February 21, 2025 02:40 5m 37s main
February 21, 2025 02:40 5m 37s
Close inactive issues and PRs
Close inactive issues and PRs #110: Scheduled
February 21, 2025 02:32 13s main
February 21, 2025 02:32 13s
Close inactive issues and PRs
Close inactive issues and PRs #109: Scheduled
February 20, 2025 02:32 11s main
February 20, 2025 02:32 11s
Close inactive issues and PRs
Close inactive issues and PRs #108: Scheduled
February 19, 2025 02:32 12s main
February 19, 2025 02:32 12s
[Bugfix][CI][V1] Work around V1 + CUDA Graph + torch._scaled_mm fallb…
pre-commit #18: Commit b3942e1 pushed by MengqingCao
February 18, 2025 02:32 4m 35s main
February 18, 2025 02:32 4m 35s
Close inactive issues and PRs
Close inactive issues and PRs #107: Scheduled
February 18, 2025 02:30 11s main
February 18, 2025 02:30 11s
[Feature][Spec Decode] Simplify the use of Eagle Spec Decode (#12304)
pre-commit #17: Commit 46cdd59 pushed by MengqingCao
February 17, 2025 06:04 4m 41s main
February 17, 2025 06:04 4m 41s
Close inactive issues and PRs
Close inactive issues and PRs #106: Scheduled
February 17, 2025 02:35 10s main
February 17, 2025 02:35 10s
[V1][BugFix] Clean up rejection sampler & Fix warning msg (#13362)
pre-commit #16: Commit 69e1d23 pushed by MengqingCao
February 17, 2025 01:28 6m 8s main
February 17, 2025 01:28 6m 8s
Close inactive issues and PRs
Close inactive issues and PRs #105: Scheduled
February 16, 2025 02:36 10s main
February 16, 2025 02:36 10s
Close inactive issues and PRs
Close inactive issues and PRs #104: Scheduled
February 15, 2025 02:29 11s main
February 15, 2025 02:29 11s
Close inactive issues and PRs
Close inactive issues and PRs #103: Scheduled
February 14, 2025 02:31 10s main
February 14, 2025 02:31 10s
Close inactive issues and PRs
Close inactive issues and PRs #102: Scheduled
February 13, 2025 02:31 12s main
February 13, 2025 02:31 12s
[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAtt…
pre-commit #15: Commit e92694b pushed by MengqingCao
February 12, 2025 06:16 5m 51s main
February 12, 2025 06:16 5m 51s