Skip to content

Actions: Nexesenex/croco.cpp

Server

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
249 workflow runs
249 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Compilade/cuda tq2 0
Server #254: Pull request #329 opened by Nexesenex
January 17, 2025 03:35 5m 46s ggerganov:compilade/cuda-tq2_0
January 17, 2025 03:35 5m 46s
Cuda back
Server #253: Pull request #328 opened by Nexesenex
January 15, 2025 00:13 5m 25s JohannesGaessler:cuda-back
January 15, 2025 00:13 5m 25s
Compilade/cuda tq2 0
Server #251: Pull request #326 opened by Nexesenex
January 10, 2025 22:18 6m 39s ggerganov:compilade/cuda-tq2_0
January 10, 2025 22:18 6m 39s
llama: Ensure KV cache is fully defragmented.
Server #250: Pull request #325 opened by Nexesenex
December 25, 2024 07:14 5m 44s jessegross:kv_defrag
December 25, 2024 07:14 5m 44s
Faster ssm scan
Server #249: Pull request #324 opened by Nexesenex
December 19, 2024 21:41 4m 10s A3shTnT:faster_ssm_scan
December 19, 2024 21:41 4m 10s
Sl/cuda opt argmax
Server #248: Pull request #323 opened by Nexesenex
November 21, 2024 10:50 8m 2s ggerganov:sl/cuda-opt-argmax
November 21, 2024 10:50 8m 2s
CUDA: remove DMMV, consolidate F16 mult mat vec
Server #247: Pull request #322 opened by Nexesenex
November 16, 2024 06:09 7m 2s JohannesGaessler:cuda-mmv-5
November 16, 2024 06:09 7m 2s
Sl/aligned alloc no abort
Server #246: Pull request #321 opened by Nexesenex
November 4, 2024 12:04 7m 54s ggerganov:sl/aligned-alloc-no-abort
November 4, 2024 12:04 7m 54s
K shift2
Server #245: Pull request #320 opened by Nexesenex
October 26, 2024 20:22 6m 52s MaggotHATE:k-shift2
October 26, 2024 20:22 6m 52s
Grammar memo
Server #243: Pull request #318 opened by Nexesenex
October 23, 2024 08:36 27m 13s clarismiranda:grammar-memo
October 23, 2024 08:36 27m 13s
Extend sgemm.cpp support for Q5_0
Server #242: Pull request #317 opened by Nexesenex
October 23, 2024 08:32 12m 47s Srihari-mcw:add_q5_support_sgemm
October 23, 2024 08:32 12m 47s
Xsn/llama batch remove compat
Server #241: Pull request #316 opened by Nexesenex
October 15, 2024 19:38 7m 59s ngxson:xsn/llama_batch_remove_compat
October 15, 2024 19:38 7m 59s
cuda : fix defrag with quantized KV (#9319)
Server #239: Pull request #311 opened by Nexesenex
September 5, 2024 18:32 9m 53s ggerganov:master
September 5, 2024 18:32 9m 53s
b3639
Server #238: Pull request #310 opened by Nexesenex
August 28, 2024 09:11 9m 29s ggerganov:master
August 28, 2024 09:11 9m 29s
b3631
Server #237: Pull request #309 opened by Nexesenex
August 27, 2024 02:30 9m 37s ggerganov:master
August 27, 2024 02:30 9m 37s
b3617
Server #236: Pull request #308 opened by Nexesenex
August 23, 2024 09:31 16m 23s ggerganov:master
August 23, 2024 09:31 16m 23s
Fix llama minitron
Server #235: Pull request #307 opened by Nexesenex
August 23, 2024 09:27 9m 17s nyxkrage:fix-llama-minitron
August 23, 2024 09:27 9m 17s
b3615
Server #234: Pull request #306 opened by Nexesenex
August 22, 2024 07:57 9m 36s ggerganov:master
August 22, 2024 07:57 9m 36s
b3602
Server #233: Pull request #305 opened by Nexesenex
August 18, 2024 15:28 19m 50s ggerganov:master
August 18, 2024 15:28 19m 50s
b3599
Server #232: Pull request #304 opened by Nexesenex
August 16, 2024 20:07 10m 24s ggerganov:master
August 16, 2024 20:07 10m 24s
b3596
Server #231: Pull request #303 opened by Nexesenex
August 16, 2024 10:06 20m 10s ggerganov:master
August 16, 2024 10:06 20m 10s
Const ref pair
Server #230: Pull request #302 opened by Nexesenex
August 15, 2024 22:31 9m 12s GermanAizek:const-ref-pair
August 15, 2024 22:31 9m 12s