Skip to content

Actions: NVIDIA/TensorRT-LLM

auto-assign

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
75 workflow runs
75 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Mpool Failure on H100 DGX node
auto-assign #77: Issue #2649 labeled by christian-ci
January 2, 2025 19:38 3s
January 2, 2025 19:38 3s
gemma 2 convert_checkpoint takes gpu ram more than needed
auto-assign #76: Issue #2647 labeled by Alireza3242
January 2, 2025 14:17 2s
January 2, 2025 14:17 2s
Failed to build engine with lookahead_decoding
auto-assign #75: Issue #2641 labeled by aikitoria
December 31, 2024 21:29 2s
December 31, 2024 21:29 2s
December 30, 2024 06:15 2s
Cpp runner outputs wrong results when using lora + tensor parallelism
auto-assign #73: Issue #2634 labeled by ShuaiShao93
December 28, 2024 00:04 2s
December 28, 2024 00:04 2s
Troubleshoot mistral model
auto-assign #72: Issue #2632 labeled by krishnanpooja
December 26, 2024 12:35 3s
December 26, 2024 12:35 3s
Qwen2.5-72B-Instruct YaRN BUG
auto-assign #71: Issue #2630 labeled by PaulX1029
December 26, 2024 06:10 2s
December 26, 2024 06:10 2s
Adding custom sampling config
auto-assign #70: Issue #2609 labeled by nv-guomingz
December 24, 2024 15:28 3s
December 24, 2024 15:28 3s
[Performance] What is the purpose of compiling a model?
auto-assign #69: Issue #2617 labeled by nv-guomingz
December 24, 2024 15:26 2s
December 24, 2024 15:26 2s
December 24, 2024 15:25 39s
SIGABRT while trying to build trtllm engine for biomistral model on T4
auto-assign #67: Issue #2619 labeled by nv-guomingz
December 24, 2024 15:24 2s
December 24, 2024 15:24 2s
Performance of streaming requests is worse than non-streaming
auto-assign #66: Issue #2613 labeled by nv-guomingz
December 24, 2024 15:21 49s
December 24, 2024 15:21 49s
Phi4 support?
auto-assign #65: Issue #2616 labeled by nv-guomingz
December 24, 2024 15:07 3s
December 24, 2024 15:07 3s
support for T4
auto-assign #64: Issue #2620 labeled by nv-guomingz
December 24, 2024 15:04 2s
December 24, 2024 15:04 2s
support for T4
auto-assign #63: Issue #2620 labeled by krishnanpooja
December 24, 2024 11:32 2s
December 24, 2024 11:32 2s
SIGABRT while trying to build trtllm engine for biomistral model on T4
auto-assign #62: Issue #2619 labeled by krishnanpooja
December 24, 2024 10:44 3s
December 24, 2024 10:44 3s
[Performance] What is the purpose of compiling a model?
auto-assign #61: Issue #2617 labeled by Flynn-Zh
December 24, 2024 10:03 2s
December 24, 2024 10:03 2s
Performance of streaming requests is worse than non-streaming
auto-assign #60: Issue #2613 labeled by activezhao
December 24, 2024 07:50 2s
December 24, 2024 07:50 2s
Adding custom sampling config
auto-assign #59: Issue #2609 labeled by buddhapuneeth
December 23, 2024 23:58 3s
December 23, 2024 23:58 3s
SmoothQuant doesn't work with lora
auto-assign #58: Issue #2604 labeled by nv-guomingz
December 23, 2024 06:11 39s
December 23, 2024 06:11 39s
[Feature Request] Better support for w4a8 quantization
auto-assign #57: Issue #2605 labeled by nv-guomingz
December 23, 2024 06:09 47s
December 23, 2024 06:09 47s
Gemma 2 LoRA support
auto-assign #56: Issue #2606 labeled by nv-guomingz
December 23, 2024 06:09 43s
December 23, 2024 06:09 43s
SmoothQuant doesn't work with lora
auto-assign #55: Issue #2604 labeled by ShuaiShao93
December 20, 2024 20:39 2s
December 20, 2024 20:39 2s
lora doesn't work with --use_fp8_rowwise
auto-assign #54: Issue #2603 labeled by ShuaiShao93
December 20, 2024 20:15 2s
December 20, 2024 20:15 2s