Skip to content

Actions: ROCm/vllm

Cleanup PR Body

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
79 workflow runs
79 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Updated fused_moe configs for MI325X with Triton 3.2
Cleanup PR Body #79: Pull request #345 opened by JArnoldAMD
December 21, 2024 02:21 24s
December 21, 2024 02:21 24s
Update MI300X fused_moe configs for Triton 3.2
Cleanup PR Body #78: Pull request #344 opened by JArnoldAMD
December 20, 2024 20:59 23s
December 20, 2024 20:59 23s
Library versions bump
Cleanup PR Body #77: Pull request #343 opened by gshtras
December 20, 2024 16:00 26s
December 20, 2024 16:00 26s
[Fix] fix_vllm_moe_quant
Cleanup PR Body #76: Pull request #342 opened by belovedxixi
December 20, 2024 10:24 24s
December 20, 2024 10:24 24s
[Fix] fix_vllm_moe_quant
Cleanup PR Body #75: Pull request #341 edited by belovedxixi
December 20, 2024 04:08 22s
December 20, 2024 04:08 22s
[Fix] fix_vllm_moe_quant
Cleanup PR Body #74: Pull request #341 opened by belovedxixi
December 20, 2024 04:05 15s
December 20, 2024 04:05 15s
Ingest FP8 attn scales and use them in ROCm FlashAttention
Cleanup PR Body #73: Pull request #338 edited by mawong-amd
December 19, 2024 23:23 23s
December 19, 2024 23:23 23s
Ingest FP8 attn scales and use them in ROCm FlashAttention
Cleanup PR Body #72: Pull request #338 edited by mawong-amd
December 19, 2024 23:22 20s
December 19, 2024 23:22 20s
Ingest fix
Cleanup PR Body #71: Pull request #340 opened by gshtras
December 19, 2024 22:09 21s
December 19, 2024 22:09 21s
Ingest FP8 attn scales and use them in ROCm FlashAttention
Cleanup PR Body #70: Pull request #338 edited by mawong-amd
December 19, 2024 01:41 19s
December 19, 2024 01:41 19s
Ingest FP8 attn scales and use them in ROCm FlashAttention
Cleanup PR Body #69: Pull request #338 opened by mawong-amd
December 19, 2024 01:40 15s
December 19, 2024 01:40 15s
Properly initializing the new field in the attn metadata
Cleanup PR Body #68: Pull request #337 opened by gshtras
December 18, 2024 21:43 23s
December 18, 2024 21:43 23s
Using the generic base image created by the vllm-ci pipeline
Cleanup PR Body #67: Pull request #336 opened by gshtras
December 18, 2024 16:56 22s
December 18, 2024 16:56 22s
Mllama kv scale fix
Cleanup PR Body #66: Pull request #335 opened by gshtras
December 18, 2024 16:41 19s
December 18, 2024 16:41 19s
[Minor] updating Docker manifest
Cleanup PR Body #65: Pull request #334 opened by arakowsk-amd
December 18, 2024 03:57 21s
December 18, 2024 03:57 21s
Fixed the new condition for fp8 type
Cleanup PR Body #64: Pull request #333 opened by gshtras
December 17, 2024 23:42 24s
December 17, 2024 23:42 24s
Fix regression from #246
Cleanup PR Body #63: Pull request #332 opened by gshtras
December 16, 2024 22:42 16s
December 16, 2024 22:42 16s
[Minor] Updating Dev Docker docs
Cleanup PR Body #62: Pull request #331 opened by arakowsk-amd
December 16, 2024 21:52 23s
December 16, 2024 21:52 23s
Upstream merge 24 12 16
Cleanup PR Body #61: Pull request #330 opened by gshtras
December 16, 2024 17:43 23s
December 16, 2024 17:43 23s
Merging PR#327 into main branch
Cleanup PR Body #60: Pull request #328 edited by pramenku
December 13, 2024 13:16 25s
December 13, 2024 13:16 25s
Merging PR#327 into main branch
Cleanup PR Body #59: Pull request #328 opened by pramenku
December 13, 2024 12:34 23s
December 13, 2024 12:34 23s
Fix logging of the vLLM Config (#11143)
Cleanup PR Body #58: Pull request #325 opened by gshtras
December 12, 2024 20:18 20s
December 12, 2024 20:18 20s
Disable auto enabling chunked prefill
Cleanup PR Body #57: Pull request #324 opened by gshtras
December 12, 2024 20:03 15s
December 12, 2024 20:03 15s
Disable triton FA
Cleanup PR Body #56: Pull request #323 opened by hliuca
December 12, 2024 19:25 19s
December 12, 2024 19:25 19s
Update README.md
Cleanup PR Body #55: Pull request #322 edited by gshtras
December 12, 2024 16:33 18s
December 12, 2024 16:33 18s