Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) ggml changes relating to the ggml tensor library for machine learning
#15188 opened Aug 9, 2025 by Tak-RS Loading…
common : add GLM-4.5 tool calling support
#15186 opened Aug 8, 2025 by dhandhalyabhavik Loading…
ggml: add conv3d op ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15182 opened Aug 8, 2025 by rmatif Loading…
gpt-oss: implement harmony parsing examples server testing Everything test related
#15181 opened Aug 8, 2025 by aldehir Loading…
CUDA: add attention sinks for tile and wmma ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15178 opened Aug 8, 2025 by am17an Loading…
kv-cache : log (debug) all streams in find_slot
#15176 opened Aug 8, 2025 by danbev Loading…
server : enable -td and -tbd parameters examples server
#15172 opened Aug 8, 2025 by CISC Loading…
MoE Expert manipulation args
#15165 opened Aug 8, 2025 by kooshi Loading…
tool-call: Qwen3 Coder chat format support testing Everything test related
#15162 opened Aug 8, 2025 by ochafik Draft
Fix prompt cache
#15160 opened Aug 7, 2025 by 708-145 Loading…
sycl: Fix and disable more configurations of mul_mat ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#15151 opened Aug 7, 2025 by Rbiessy Loading…
kleidiai: fix unsigned overflow bug ggml changes relating to the ggml tensor library for machine learning
#15150 opened Aug 7, 2025 by chaxu01 Loading…
chat : Avoid partial reasoning tags in response content testing Everything test related
#15149 opened Aug 7, 2025 by p1-0tr Loading…
SVE support for exponential functions ggml changes relating to the ggml tensor library for machine learning
#15145 opened Aug 7, 2025 by s-goto-11 Loading…
OpenCL: allow mixed f16/f32 add ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#15140 opened Aug 6, 2025 by rmatif Loading…
CUDA: Optimize reduce_rows_f32 kernel, leading up to 25x perf improvement on kernel-level and 10% perf increase for Gemma3n ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15132 opened Aug 6, 2025 by ORippler Loading…
Add T5Gemma support #14940 python python script changes
#15123 opened Aug 6, 2025 by baonudesifeizhai Loading…
ggml: aarch64: Implement SVE F16 kernels for vector functions ggml changes relating to the ggml tensor library for machine learning
#15115 opened Aug 6, 2025 by Vithulep Loading…
CUDA/HIP: ssm-scan: switch from shared memory to reisters, fixes indexing problem on warp64 devices ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15101 opened Aug 5, 2025 by IMbackK Loading…
model : add reasoning/tool parsing to Llama 3.x Nemotron testing Everything test related
#15083 opened Aug 5, 2025 by aldehir Draft
ProTip! Adding no:label will show everything without a label.