Skip to content

Actions: neuralmagic/vllm

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
55 workflow runs
55 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[WIP, Kernel] (2/N) Machete - Integrate into GPTQMarlinLinearMethod and CompressedTensorsWNA16
PR Reminder Comment Bot #5: Pull request #5 opened by LucasWilkinson
August 7, 2024 23:10 13s
August 7, 2024 23:10 13s
Implement GPTQMarlinMoEMethod to support quantized MOE models
PR Reminder Comment Bot #4: Pull request #4 opened by DhruvaBansal00
August 7, 2024 00:22 9s
August 7, 2024 00:22 9s
DO NOT MERGE : Layer-by-Layer Profiling
PR Reminder Comment Bot #3: Pull request #3 opened by varun-sundar-rabindranath
August 5, 2024 15:19 14s
August 5, 2024 15:19 14s
Marlin MoE integration
PR Reminder Comment Bot #2: Pull request #2 opened by ElizaWszola
August 2, 2024 01:47 13s
August 2, 2024 01:47 13s
Add 2:4 sparsity as a quantization method
PR Reminder Comment Bot #1: Pull request #1 opened by mgoin
August 1, 2024 21:44 14s
August 1, 2024 21:44 14s