-
Notifications
You must be signed in to change notification settings - Fork 11k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bugfix: Respect n_predict=-2 in server (#12264)
examples
server
#12323
opened Mar 10, 2025 by
ishaangandhi
Loading…
vulkan: query register count and use it in a better split_k heuristic
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12319
opened Mar 10, 2025 by
jeffbolznv
Loading…
vulkan: fix bug in coopmat1 mul_mat_id
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12316
opened Mar 10, 2025 by
jeffbolznv
Loading…
CUDA/HIP: Fix fattn-vec-* when device warp size is not 32
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12315
opened Mar 10, 2025 by
IMbackK
Loading…
vulkan: Add N/2 and N/4 optimized paths in coopmat2 shader
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12312
opened Mar 10, 2025 by
jeffbolznv
Loading…
ggml : fix quantized cpy op
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#12310
opened Mar 10, 2025 by
ggerganov
Loading…
vulkan: use fp32 in coopmat2 q4_k dequant function
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12309
opened Mar 10, 2025 by
jeffbolznv
Loading…
Update build.yml for Windows Vulkan builder to use Vulkan 1.4.304 SDK…
devops
improvements to build systems and github actions
#12301
opened Mar 10, 2025 by
oscarbg
Loading…
tool-call
: Phi-4 support
testing
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
server: fix "--grammar-file" parameter
examples
server
#12285
opened Mar 9, 2025 by
dodekapod
Loading…
vulkan: Pad N dimension of B matrix for coopmat2 perf, to avoid bounds checking
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12273
opened Mar 8, 2025 by
jeffbolznv
Loading…
vulkan: fix coopmat shader generation when cross-compiling
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12272
opened Mar 8, 2025 by
Icenowy
Loading…
metal: Cache compiled library at device level
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12265
opened Mar 8, 2025 by
BB-fat
Loading…
doc: add text-based diagram of software architecture in toplevel README.md
#12263
opened Mar 8, 2025 by
zhouwg
Loading…
vulkan: optimization proposals for coopmat1 mul_mm
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12260
opened Mar 7, 2025 by
remyoudompheng
•
Draft
vulkan: Adjust coopmat2 tile sizes and selection heuristic
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12258
opened Mar 7, 2025 by
jeffbolznv
Loading…
server : Add verbose output to OAI compatible chat endpoint.
android
Issues specific to Android
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
script
Script related
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12246
opened Mar 7, 2025 by
mglambda
Loading…
Fix rocWMMA build documentation
documentation
Improvements or additions to documentation
#12243
opened Mar 7, 2025 by
Headcrabed
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.