server: allow filtering llama server response fields (#10940) #17934
build.yml
on: push
Matrix: windows-2019-cmake-cuda
Matrix: windows-latest-cmake-hip-release
Matrix: windows-latest-cmake
macOS-latest-cmake-arm64
12m 15s
macOS-latest-cmake-x64
4m 35s
ubuntu-latest-cmake
2m 50s
macOS-latest-cmake
12m 37s
ubuntu-latest-cmake-rpc
2m 25s
ubuntu-22-cmake-vulkan
15m 22s
ubuntu-22-cmake-hip
19m 11s
ubuntu-22-cmake-musa
11m 52s
ubuntu-22-cmake-sycl
4m 58s
ubuntu-22-cmake-sycl-fp16
4m 39s
macOS-latest-cmake-ios
1m 10s
macOS-latest-cmake-tvos
1m 7s
ubuntu-latest-cmake-cuda
11m 32s
windows-latest-cmake-sycl
12m 30s
windows-latest-cmake-hip
16m 37s
ios-xcode-build
1m 14s
android-build
6m 13s
Matrix: macOS-latest-swift
Matrix: ubuntu-latest-cmake-sanitizer
Matrix: windows-msys2
release
1m 13s
Annotations
1 error and 11 warnings
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
cudart-llama-bin-win-cu11.7-x64.zip
|
303 MB |
|
cudart-llama-bin-win-cu12.4-x64.zip
|
372 MB |
|
llama-bin-macos-arm64.zip
|
59.1 MB |
|
llama-bin-macos-x64.zip
|
60 MB |
|
llama-bin-ubuntu-x64.zip
|
66.8 MB |
|
llama-bin-win-avx-x64.zip
|
9.74 MB |
|
llama-bin-win-avx2-x64.zip
|
9.75 MB |
|
llama-bin-win-avx512-x64.zip
|
9.76 MB |
|
llama-bin-win-cu11.7-x64.zip
|
146 MB |
|
llama-bin-win-cu12.4-x64.zip
|
146 MB |
|
llama-bin-win-hip-x64-gfx1030.zip
|
230 MB |
|
llama-bin-win-hip-x64-gfx1100.zip
|
232 MB |
|
llama-bin-win-hip-x64-gfx1101.zip
|
232 MB |
|
llama-bin-win-kompute-x64.zip
|
10 MB |
|
llama-bin-win-llvm-arm64-opencl-adreno.zip
|
11.5 MB |
|
llama-bin-win-llvm-arm64.zip
|
11.4 MB |
|
llama-bin-win-msvc-arm64.zip
|
14.2 MB |
|
llama-bin-win-noavx-x64.zip
|
9.72 MB |
|
llama-bin-win-openblas-x64.zip
|
20.7 MB |
|
llama-bin-win-sycl-x64.zip
|
90.5 MB |
|
llama-bin-win-vulkan-x64.zip
|
11.7 MB |
|