-
Notifications
You must be signed in to change notification settings - Fork 12.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others)
ggml
changes relating to the ggml tensor library for machine learning
#15188
opened Aug 9, 2025 by
Tak-RS
Loading…
ggml: add changes relating to the ggml tensor library for machine learning
testing
Everything test related
conv3d
op
ggml
#15182
opened Aug 8, 2025 by
rmatif
Loading…
CUDA: add attention sinks for tile and wmma
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15178
opened Aug 8, 2025 by
am17an
Loading…
server : implement /api/version endpoint for ollama compatibility (#15167 )
examples
server
#15177
opened Aug 8, 2025 by
albert-polak
Loading…
GPT-OSS: parse commentary tool calls; handle glued 'json'; add unit tests (#15102)
testing
Everything test related
#15158
opened Aug 7, 2025 by
Nerexis
Loading…
sycl: Fix and disable more configurations of mul_mat
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#15151
opened Aug 7, 2025 by
Rbiessy
Loading…
kleidiai: fix unsigned overflow bug
ggml
changes relating to the ggml tensor library for machine learning
#15150
opened Aug 7, 2025 by
chaxu01
Loading…
chat : Avoid partial reasoning tags in response content
testing
Everything test related
#15149
opened Aug 7, 2025 by
p1-0tr
Loading…
SVE support for exponential functions
ggml
changes relating to the ggml tensor library for machine learning
#15145
opened Aug 7, 2025 by
s-goto-11
Loading…
OpenCL: allow mixed f16/f32 add
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15140
opened Aug 6, 2025 by
rmatif
Loading…
CUDA: Optimize changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
reduce_rows_f32
kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n
ggml
#15132
opened Aug 6, 2025 by
ORippler
Loading…
Add T5Gemma support #14940
python
python script changes
#15123
opened Aug 6, 2025 by
baonudesifeizhai
Loading…
ggml: aarch64: Implement SVE F16 kernels for vector functions
ggml
changes relating to the ggml tensor library for machine learning
#15115
opened Aug 6, 2025 by
Vithulep
Loading…
CUDA/HIP: ssm-scan: switch from shared memory to reisters, fixes indexing problem on warp64 devices
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15101
opened Aug 5, 2025 by
IMbackK
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.