Skip to content

Actions: vllm-project/vllm

Lint and Deploy Charts

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
10,061 workflow runs
10,061 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add cutlass support for blackwell fp8 blockwise gemm
Lint and Deploy Charts #9912: Pull request #14383 synchronize by wenscarl
March 10, 2025 20:00 7m 10s wenscarl:scale_mm_bw_sm100
March 10, 2025 20:00 7m 10s
[V1] V1 Enablement Oracle
Lint and Deploy Charts #9911: Pull request #13726 synchronize by robertgshaw2-redhat
March 10, 2025 19:44 7m 7s v1-default
March 10, 2025 19:44 7m 7s
[V1] V1 Enablement Oracle
Lint and Deploy Charts #9910: Pull request #13726 synchronize by robertgshaw2-redhat
March 10, 2025 19:29 7m 40s v1-default
March 10, 2025 19:29 7m 40s
[VLM] Merged multi-modal processor for Pixtral
Lint and Deploy Charts #9909: Pull request #12211 synchronize by DarkLight1337
March 10, 2025 19:07 7m 9s Flechman:pixtral-mm-processor
March 10, 2025 19:07 7m 9s
[VLM] Merged multi-modal processor for Pixtral
Lint and Deploy Charts #9908: Pull request #12211 synchronize by DarkLight1337
March 10, 2025 18:50 7m 0s Flechman:pixtral-mm-processor
March 10, 2025 18:50 7m 0s
[Perf]:Optimize qwen2-vl to reduce cudaMemcpyAsync
Lint and Deploy Charts #9906: Pull request #14377 synchronize by cynthieye
March 10, 2025 18:33 7m 50s cynthieye:main
March 10, 2025 18:33 7m 50s
[INTEL-HPU] Deepseek R1 model enabling for Intel Gaudi
Lint and Deploy Charts #9905: Pull request #14455 synchronize by xuechendi
March 10, 2025 18:02 7m 7s HabanaAI:deepseek_r1_upstream
March 10, 2025 18:02 7m 7s
[Model] Extend Ultravox to accept audio longer than 30s
Lint and Deploy Charts #9904: Pull request #13631 synchronize by farzadab
March 10, 2025 17:42 7m 5s fixie-ai:farzad-long-audio
March 10, 2025 17:42 7m 5s
[INTEL-HPU] Deepseek R1 model enabling for Intel Gaudi
Lint and Deploy Charts #9902: Pull request #14455 synchronize by xuechendi
March 10, 2025 17:37 7m 9s HabanaAI:deepseek_r1_upstream
March 10, 2025 17:37 7m 9s
[Kernel] moe wna16 cuda kernel
Lint and Deploy Charts #9901: Pull request #13321 synchronize by jinzhen-lin
March 10, 2025 17:29 7m 16s jinzhen-lin:moe_wna16_cuda_kernel
March 10, 2025 17:29 7m 16s
dynamic distpatch of fp8 kernels
Lint and Deploy Charts #9898: Pull request #14245 synchronize by jeffdaily
March 10, 2025 17:17 7m 3s ROCm:is_fp8_fnuz
March 10, 2025 17:17 7m 3s
[Model] Add Reasoning Parser for Granite Models
Lint and Deploy Charts #9897: Pull request #14202 synchronize by alex-jw-brooks
March 10, 2025 16:54 7m 12s alex-jw-brooks:granite_reasoning
March 10, 2025 16:54 7m 12s
[Minor] Update the tqdm bar for parallel sampling
Lint and Deploy Charts #9893: Pull request #14571 opened by WoosukKwon
March 10, 2025 16:31 7m 37s fix-parallel-sample
March 10, 2025 16:31 7m 37s
Mseznec/flash attention fp8
Lint and Deploy Charts #9891: Pull request #14570 synchronize by mickaelseznec
March 10, 2025 16:08 7m 9s mickaelseznec:mseznec/flash-attention-fp8
March 10, 2025 16:08 7m 9s
[Hardware][TPU][V1] Multi-LoRA implementation for the V1 TPU backend
Lint and Deploy Charts #9890: Pull request #14238 synchronize by Akshat-Tripathi
March 10, 2025 16:05 6m 57s krai:multi_lora_tpu_v1
March 10, 2025 16:05 6m 57s
[Hardware][TPU][V1] Multi-LoRA implementation for the V1 TPU backend
Lint and Deploy Charts #9889: Pull request #14238 synchronize by Akshat-Tripathi
March 10, 2025 15:57 7m 26s krai:multi_lora_tpu_v1
March 10, 2025 15:57 7m 26s
dynamic distpatch of fp8 kernels
Lint and Deploy Charts #9888: Pull request #14245 synchronize by jeffdaily
March 10, 2025 15:45 7m 12s ROCm:is_fp8_fnuz
March 10, 2025 15:45 7m 12s