-
Notifications
You must be signed in to change notification settings - Fork 13.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci : use registry cache for docker builds
devops
improvements to build systems and github actions
#16366
opened Oct 1, 2025 by
CISC
Loading…
vulkan: Fix FA coopmat1 invalid array indexing
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16365
opened Oct 1, 2025 by
jeffbolznv
Loading…
Add support to New feature or request
examples
server/webui
server
◁think▷...◁/think▷
format and DRY the thinking processing logic
enhancement
#16364
opened Sep 30, 2025 by
allozaur
Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362
opened Sep 30, 2025 by
GittyBurstein
Loading…
feat: render user content as markdown option
examples
server
#16358
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
ggml webgpu: add support for soft_max, optimize rms_norm
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16357
opened Sep 30, 2025 by
reeselevine
Loading…
Fix mobile unwanted scrolling
examples
server
#16356
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
vulkan: Replace uses of maxMemoryAllocationSize and VK_WHOLE_SIZE
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16354
opened Sep 30, 2025 by
jeffbolznv
Loading…
SYCL SET operator optimized for F32 tensors
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350
opened Sep 30, 2025 by
GittyBurstein
Loading…
Update build.md
documentation
Improvements or additions to documentation
#16346
opened Sep 30, 2025 by
refine360-debug
Loading…
vulkan : incremental shader builds
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16341
opened Sep 29, 2025 by
Acly
Loading…
Add optional setting for showing "Model used:" information
examples
server
#16337
opened Sep 29, 2025 by
allozaur
Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU
ggml
changes relating to the ggml tensor library for machine learning
#16333
opened Sep 29, 2025 by
angt
Loading…
vulkan: in flash attention, bounds check against nem1 (don't rely on GGML_KQ_MASK_PAD)
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16316
opened Sep 28, 2025 by
jeffbolznv
Loading…
ggml : fix unaligned access in AMX code
ggml
changes relating to the ggml tensor library for machine learning
#16315
opened Sep 28, 2025 by
ggerganov
Loading…
ggml : remove SVE paths
ggml
changes relating to the ggml tensor library for machine learning
#16314
opened Sep 28, 2025 by
ggerganov
Loading…
ggml-backend : unify the dl_load_library() return type
ggml
changes relating to the ggml tensor library for machine learning
#16313
opened Sep 28, 2025 by
haiyuewa
Loading…
Enable Intel AMX acceleration while in CPU/GPU hybrid with new "--amx" toggle.
examples
#16310
opened Sep 28, 2025 by
Gadflyii
Loading…
cuda : Disable host buffers on integrated GPUs (#15034)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16308
opened Sep 28, 2025 by
ai-fonsi
Loading…
ci: Properly install rocwmma for hip builds
devops
improvements to build systems and github actions
#16305
opened Sep 28, 2025 by
IMbackK
Loading…
Add a deepwiki badge to auto-refresh the wiki-in-deepwiki weekly.
#16296
opened Sep 28, 2025 by
0400H
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-01.