-
Notifications
You must be signed in to change notification settings - Fork 13.7k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(mtmd): add Eagle2-VL multimodal support (mmproj + SigLIP pipeline)
examples
python
python script changes
#17224
opened Nov 12, 2025 by
YaelGitAccount
Loading…
vulkan: remove shell call from vulkan-shaders-gen tool, revert file check
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17219
opened Nov 12, 2025 by
0cc4m
Loading…
sycl: unify unary kernels with a generic implementation and enable wide operator support
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17213
opened Nov 12, 2025 by
shani-f
Loading…
ggml-hexagon: add changes relating to the ggml tensor library for machine learning
hex_supported_buffer for better buffer supported check
ggml
[SYCL]refactor pad_reflect_1d to make the UT case pass
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17204
opened Nov 12, 2025 by
NeoZhangJianyu
Loading…
ci: Add openEuler-cann in build/release
Ascend NPU
issues specific to Ascend NPUs
devops
improvements to build systems and github actions
#17192
opened Nov 12, 2025 by
xuedinge233
Loading…
vulkan: skip all-negative-inf blocks in FA
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17186
opened Nov 12, 2025 by
jeffbolznv
Loading…
ggml webgpu: add support for emscripten builds
build
Compilation issues
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
testing
Everything test related
#17184
opened Nov 12, 2025 by
reeselevine
Loading…
vulkan: add LOG operation support for F32 and F16
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17183
opened Nov 12, 2025 by
zayac
Loading…
opencl: add kernel to handle mat mul in attention to improve encoding speed
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#17181
opened Nov 11, 2025 by
shaofeiqi
Loading…
cmake : fix ARM feature verification
ggml
changes relating to the ggml tensor library for machine learning
#17170
opened Nov 11, 2025 by
angt
Loading…
vulkan: change graph_compute to be async and enable get_tensor_async
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17158
opened Nov 10, 2025 by
jeffbolznv
Loading…
HIP: WMMA-MMQ kernels for RDNA 4
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17156
opened Nov 10, 2025 by
jiachengjason
Loading…
llama.android : Rewrite Android binding
android
Issues specific to Android
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
#17152
opened Nov 10, 2025 by
hanyin-arm
Loading…
vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17147
opened Nov 10, 2025 by
SavicStefan
Loading…
metal : make the FA extra sizes consistent
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17143
opened Nov 10, 2025 by
ggerganov
Loading…
Add complete Megrez-MoE support: GGUF conversion + inference.
model
Model specific
python
python script changes
#17141
opened Nov 10, 2025 by
tamarPal
Loading…
llama: introduce support for model-embedded sampling parameters
python
python script changes
#17120
opened Nov 9, 2025 by
taronaeo
Loading…
rpc : fix alloc size logic
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17116
opened Nov 9, 2025 by
ggerganov
Loading…
2 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.