Skip to content

Actions: neuralmagic/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
25 workflow runs
25 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[V1][BugFix] Free encoder cache for aborted requests (#12545)
pre-commit #25: Commit e0cc5f2 pushed by SageMoore
January 29, 2025 23:26 5m 27s main
January 29, 2025 23:26 5m 27s
[WIP] Working Grouped gemm with group ID
pre-commit #24: Pull request #48 synchronize by ElizaWszola
January 29, 2025 13:55 4m 43s grouped-gemm-with-group-id
January 29, 2025 13:55 4m 43s
[WIP] Working Grouped gemm with group ID
pre-commit #23: Pull request #48 synchronize by ElizaWszola
January 29, 2025 07:10 4m 28s grouped-gemm-with-group-id
January 29, 2025 07:10 4m 28s
[WIP] Working Grouped gemm with group ID
pre-commit #22: Pull request #48 synchronize by ElizaWszola
January 28, 2025 04:46 4m 31s grouped-gemm-with-group-id
January 28, 2025 04:46 4m 31s
[Bugfix] Fix gpt2 GGUF inference (#12467)
pre-commit #21: Commit ce69f7f pushed by tlrmchlsmth
January 27, 2025 15:14 4m 36s main
January 27, 2025 15:14 4m 36s
January 27, 2025 09:38 4m 31s
[Bugfix/CI] Fix broken kernels/test_mha.py (#12450)
pre-commit #19: Commit 72f4880 pushed by tlrmchlsmth
January 26, 2025 18:59 4m 26s main
January 26, 2025 18:59 4m 26s
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)
pre-commit #18: Commit aa2cd2c pushed by tlrmchlsmth
January 26, 2025 17:09 4m 21s main
January 26, 2025 17:09 4m 21s
Bump jinja2 from 3.1.4 to 3.1.5
pre-commit #17: Pull request #49 synchronize by dependabot bot
January 25, 2025 20:52 4m 31s dependabot/pip/jinja2-3.1.5
January 25, 2025 20:52 4m 31s
[TPU][CI] Update torchxla version in requirement-tpu.txt (#12422)
pre-commit #16: Commit 324960a pushed by tlrmchlsmth
January 25, 2025 20:48 4m 18s main
January 25, 2025 20:48 4m 18s
[ROCm][MoE] MI300 tuned configs Mixtral-8x(7B,22B) | fp16, fp8 (#12408)
pre-commit #15: Commit bf21481 pushed by tlrmchlsmth
January 25, 2025 04:40 4m 32s main
January 25, 2025 04:40 4m 32s
[WIP] Working Grouped gemm with group ID
pre-commit #14: Pull request #48 synchronize by ElizaWszola
January 24, 2025 22:41 4m 30s grouped-gemm-with-group-id
January 24, 2025 22:41 4m 30s
[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build o…
pre-commit #13: Commit 3132a93 pushed by tlrmchlsmth
January 24, 2025 21:08 4m 22s main
January 24, 2025 21:08 4m 22s
[Bugfix][Kernel] Fix CUDA 11.8 being broken by FA3 build (#12375)
pre-commit #12: Commit ab5bbf5 pushed by tlrmchlsmth
January 24, 2025 16:11 4m 23s main
January 24, 2025 16:11 4m 23s
[Misc] Enable proxy support in benchmark script (#12356)
pre-commit #11: Commit 3bb8e2c pushed by tlrmchlsmth
January 24, 2025 15:03 4m 29s main
January 24, 2025 15:03 4m 29s
[Docs] Document Phi-4 support (#12362)
pre-commit #10: Commit 2cbeeda pushed by SageMoore
January 23, 2025 20:17 4m 21s main
January 23, 2025 20:17 4m 21s
[WIP] Working Grouped gemm with group ID
pre-commit #9: Pull request #48 synchronize by ElizaWszola
January 23, 2025 18:28 4m 37s grouped-gemm-with-group-id
January 23, 2025 18:28 4m 37s
[WIP] Working Grouped gemm with group ID
pre-commit #8: Pull request #48 synchronize by ElizaWszola
January 23, 2025 15:48 4m 39s grouped-gemm-with-group-id
January 23, 2025 15:48 4m 39s
Add: Support for Sparse24Bitmask Compressed Models
pre-commit #7: Pull request #47 synchronize by rahul-tuli
January 22, 2025 21:39 4m 22s rahul-bitmask-additions
January 22, 2025 21:39 4m 22s
Add: Support for Sparse24Bitmask Compressed Models
pre-commit #6: Pull request #47 synchronize by rahul-tuli
January 22, 2025 21:35 4m 29s rahul-bitmask-additions
January 22, 2025 21:35 4m 29s
Add: Support for Sparse24Bitmask Compressed Models
pre-commit #5: Pull request #47 synchronize by rahul-tuli
January 22, 2025 21:15 4m 29s rahul-bitmask-additions
January 22, 2025 21:15 4m 29s
[Core] Support reset_prefix_cache (#12284)
pre-commit #4: Commit 7206ce4 pushed by tlrmchlsmth
January 22, 2025 20:37 4m 25s main
January 22, 2025 20:37 4m 25s
Add: Support for Sparse24Bitmask Compressed Models
pre-commit #3: Pull request #47 synchronize by rahul-tuli
January 22, 2025 19:29 4m 30s rahul-bitmask-additions
January 22, 2025 19:29 4m 30s
Add: Support for Sparse24Bitmask Compressed Models
pre-commit #2: Pull request #47 synchronize by rahul-tuli
January 22, 2025 18:23 5m 12s rahul-bitmask-additions
January 22, 2025 18:23 5m 12s
[Bugfix] Fix HPU multiprocessing executor (#12167)
pre-commit #1: Commit 96f6a75 pushed by rahul-tuli
January 22, 2025 18:20 5m 12s main
January 22, 2025 18:20 5m 12s