-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[fix] Fix 32-bit narrowing issue in export-lora and mtmd clip
#14503
opened Jul 2, 2025 by
kiwi142857
Loading…
model : add support for apple/DiffuCoder-7B-cpGRPO
#14502
opened Jul 2, 2025 by
gabriellarson
Loading…
ggml : remove kompute backend
build
Compilation issues
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
script
Script related
testing
Everything test related
#14501
opened Jul 2, 2025 by
ggerganov
Loading…
MUSA: upgrade musa sdk to <<TBD>>
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14498
opened Jul 2, 2025 by
yeahdongcn
•
Draft
CUDA: add dynamic shared mem to softmax, refactor general usage
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14497
opened Jul 2, 2025 by
am17an
Loading…
vulkan: unpack more values at a time for iquants mat mul
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14485
opened Jul 1, 2025 by
netrunnereve
Loading…
Compute buffer and KV-cache aware layer distribution for multi-GPU inference
#14484
opened Jul 1, 2025 by
borebot
Loading…
ggml: backward pass for split swiglu
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#14483
opened Jul 1, 2025 by
JohannesGaessler
Loading…
opencl : add GELU_ERF
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14476
opened Jul 1, 2025 by
CISC
Loading…
server : (webui) let server send locally-defined default webui settings
examples
server
#14468
opened Jun 30, 2025 by
woof-dog
Loading…
Chore: batch prompts, extract tensors specific layer
examples
#14463
opened Jun 30, 2025 by
VakantieModus
Loading…
convert : correct gemma 3n conversion
python
python script changes
#14450
opened Jun 29, 2025 by
ngxson
Loading…
Pr/7191
build
Compilation issues
devops
improvements to build systems and github actions
python
python script changes
#14447
opened Jun 29, 2025 by
esrakorkmz
Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#14445
opened Jun 29, 2025 by
CISC
Loading…
Added CI with RISC-V RVV1.0 Hardware
devops
improvements to build systems and github actions
#14439
opened Jun 29, 2025 by
alitariq4589
Loading…
model : add hunyuan moe
python
python script changes
#14425
opened Jun 27, 2025 by
ngxson
Loading…
4 tasks done
ggml : add ggml_scale_bias
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
[CANN] weight format to nz for Ascend310P3
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14407
opened Jun 27, 2025 by
tqgy6
Loading…
OpenCL: add conv2d kernel
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14403
opened Jun 26, 2025 by
rmatif
Loading…
ggml : add pointer to attach user data
ggml
changes relating to the ggml tensor library for machine learning
#14397
opened Jun 26, 2025 by
koush
Loading…
compare-commits.sh: support both llama-bench and test-backend-ops
python
python script changes
script
Script related
#14392
opened Jun 26, 2025 by
yeahdongcn
Loading…
ggml-cpu: Build variant targeting Neoverse-V2
ggml
changes relating to the ggml tensor library for machine learning
#14380
opened Jun 25, 2025 by
ckastner
Loading…
webui: preserve partial content when streaming errors occur
examples
server
#14374
opened Jun 25, 2025 by
Aaryan-549
Loading…
5 of 8 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.