server : allow using LoRA adapters per-request (#10994) #658
build.yml
on: push
Matrix: windows-2019-cmake-cuda
Matrix: windows-latest-cmake-hip-release
Matrix: windows-latest-cmake
macOS-latest-cmake-arm64
12m 13s
macOS-latest-cmake-x64
5m 42s
ubuntu-latest-cmake
2m 46s
macOS-latest-cmake
11m 20s
ubuntu-latest-cmake-rpc
2m 29s
ubuntu-22-cmake-vulkan
12m 26s
ubuntu-22-cmake-hip
19m 18s
ubuntu-22-cmake-musa
12m 6s
ubuntu-22-cmake-sycl
4m 47s
ubuntu-22-cmake-sycl-fp16
4m 45s
macOS-latest-cmake-ios
1m 8s
macOS-latest-cmake-tvos
1m 10s
ubuntu-latest-cmake-cuda
11m 52s
windows-latest-cmake-sycl
10m 38s
windows-latest-cmake-hip
28m 56s
ios-xcode-build
1m 40s
android-build
6m 49s
Matrix: macOS-latest-swift
Matrix: ubuntu-latest-cmake-sanitizer
Matrix: windows-msys2
release
1m 14s
Annotations
1 error and 11 warnings
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
cudart-llama-bin-win-cu11.7-x64.zip
|
303 MB |
|
cudart-llama-bin-win-cu12.4-x64.zip
|
372 MB |
|
llama-bin-macos-arm64.zip
|
59.9 MB |
|
llama-bin-macos-x64.zip
|
60.7 MB |
|
llama-bin-ubuntu-x64.zip
|
65.8 MB |
|
llama-bin-win-avx-x64.zip
|
9.75 MB |
|
llama-bin-win-avx2-x64.zip
|
9.76 MB |
|
llama-bin-win-avx512-x64.zip
|
9.77 MB |
|
llama-bin-win-cu11.7-x64.zip
|
146 MB |
|
llama-bin-win-cu12.4-x64.zip
|
146 MB |
|
llama-bin-win-hip-x64-gfx1030.zip
|
230 MB |
|
llama-bin-win-hip-x64-gfx1100.zip
|
232 MB |
|
llama-bin-win-hip-x64-gfx1101.zip
|
232 MB |
|
llama-bin-win-kompute-x64.zip
|
10 MB |
|
llama-bin-win-llvm-arm64-opencl-adreno.zip
|
11.5 MB |
|
llama-bin-win-llvm-arm64.zip
|
11.5 MB |
|
llama-bin-win-msvc-arm64.zip
|
14.2 MB |
|
llama-bin-win-noavx-x64.zip
|
9.73 MB |
|
llama-bin-win-openblas-x64.zip
|
20.7 MB |
|
llama-bin-win-sycl-x64.zip
|
90.5 MB |
|
llama-bin-win-vulkan-x64.zip
|
11.8 MB |
|