Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

clip : bring back GPU support examples
#12322 opened Mar 10, 2025 by ngxson Loading…
Enabling building llama.cpp using system libggml build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning
#12321 opened Mar 10, 2025 by ckastner Loading…
vulkan: query register count and use it in a better split_k heuristic ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12319 opened Mar 10, 2025 by jeffbolznv Loading…
vulkan: fix bug in coopmat1 mul_mat_id ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#12316 opened Mar 10, 2025 by jeffbolznv Loading…
CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12315 opened Mar 10, 2025 by IMbackK Loading…
vulkan: Add N/2 and N/4 optimized paths in coopmat2 shader ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12312 opened Mar 10, 2025 by jeffbolznv Loading…
ggml : fix quantized cpy op ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#12310 opened Mar 10, 2025 by ggerganov Loading…
vulkan: use fp32 in coopmat2 q4_k dequant function ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12309 opened Mar 10, 2025 by jeffbolznv Loading…
Update build.yml for Windows Vulkan builder to use Vulkan 1.4.304 SDK… devops improvements to build systems and github actions
#12301 opened Mar 10, 2025 by oscarbg Loading…
tool-call: Phi-4 support testing Everything test related
#12288 opened Mar 9, 2025 by jpohhhh Loading…
vulkan: Pad N dimension of B matrix for coopmat2 perf, to avoid bounds checking ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12273 opened Mar 8, 2025 by jeffbolznv Loading…
vulkan: fix coopmat shader generation when cross-compiling ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12272 opened Mar 8, 2025 by Icenowy Loading…
metal: Cache compiled library at device level Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12265 opened Mar 8, 2025 by BB-fat Loading…
Add simple-tts example examples
#12261 opened Mar 8, 2025 by danemadsen Loading…
vulkan: optimization proposals for coopmat1 mul_mm ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12260 opened Mar 7, 2025 by remyoudompheng Draft
vulkan: Adjust coopmat2 tile sizes and selection heuristic ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12258 opened Mar 7, 2025 by jeffbolznv Loading…
main : add -sysf / --system-prompt-file (#12249)
#12250 opened Mar 7, 2025 by CISC Loading…
server : Add verbose output to OAI compatible chat endpoint. android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment Nvidia GPU Issues specific to Nvidia GPUs python python script changes script Script related server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12246 opened Mar 7, 2025 by mglambda Loading…
Fix rocWMMA build documentation documentation Improvements or additions to documentation
#12243 opened Mar 7, 2025 by Headcrabed Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.