Skip to content

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

CPU/CUDA: Gemma 2 FlashAttention support (#8542)

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

Triggered via push August 24, 2024 21:38
Status Success
Total duration 1h 12m 49s
Artifacts 17

build.yml

on: push
Matrix: windows-latest-cmake-cuda
Matrix: windows-latest-cmake
macOS-latest-cmake-arm64
2m 23s
macOS-latest-cmake-arm64
macOS-latest-cmake-x64
8m 56s
macOS-latest-cmake-x64
ubuntu-focal-make
3m 48s
ubuntu-focal-make
ubuntu-latest-cmake
2m 44s
ubuntu-latest-cmake
macOS-latest-make
2m 10s
macOS-latest-make
macOS-latest-cmake
1m 44s
macOS-latest-cmake
ubuntu-focal-make-curl
3m 0s
ubuntu-focal-make-curl
ubuntu-latest-cmake-rpc
2m 20s
ubuntu-latest-cmake-rpc
ubuntu-22-cmake-vulkan
3m 2s
ubuntu-22-cmake-vulkan
ubuntu-22-cmake-hip
18m 9s
ubuntu-22-cmake-hip
ubuntu-22-cmake-sycl
10m 1s
ubuntu-22-cmake-sycl
ubuntu-22-cmake-sycl-fp16
10m 20s
ubuntu-22-cmake-sycl-fp16
macOS-latest-cmake-ios
2m 17s
macOS-latest-cmake-ios
macOS-latest-cmake-tvos
2m 5s
macOS-latest-cmake-tvos
windows-latest-cmake-sycl
11m 26s
windows-latest-cmake-sycl
windows-latest-cmake-hip
14m 14s
windows-latest-cmake-hip
ios-xcode-build
1m 29s
ios-xcode-build
android-build
15m 38s
android-build
Matrix: macOS-latest-swift
Matrix: ubuntu-latest-cmake-sanitizer
Matrix: windows-msys2
Fit to window
Zoom out
Zoom in

Annotations

1 error and 9 warnings
ubuntu-22-cmake-sycl-fp16
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
ubuntu-22-cmake-sycl-fp16
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
ubuntu-22-cmake-sycl
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
ubuntu-22-cmake-sycl
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
windows-latest-cmake-hip
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
android-build
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-java@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
ubuntu-22-cmake-hip
The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
release
The following actions uses node12 which is deprecated and will be forced to run on node16: actions/github-script@v3. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
release
The following actions use a deprecated Node.js version and will be forced to run on node20: anzz1/action-create-release@v1, actions/github-script@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

Artifacts

Produced during runtime
Name Size
cudart-llama-bin-win-cu11.7.1-x64.zip
293 MB
cudart-llama-bin-win-cu12.2.0-x64.zip
412 MB
llama-bin-macos-arm64.zip
48.4 MB
llama-bin-macos-x64.zip
50 MB
llama-bin-ubuntu-x64.zip
54 MB
llama-bin-win-avx-x64.zip
7.65 MB
llama-bin-win-avx2-x64.zip
7.65 MB
llama-bin-win-avx512-x64.zip
7.65 MB
llama-bin-win-cu11.7.1-x64.zip
143 MB
llama-bin-win-cu12.2.0-x64.zip
142 MB
llama-bin-win-kompute-x64.zip
7.92 MB
llama-bin-win-llvm-arm64.zip
11.1 MB
llama-bin-win-msvc-arm64.zip
13.2 MB
llama-bin-win-noavx-x64.zip
7.64 MB
llama-bin-win-openblas-x64.zip
18.6 MB
llama-bin-win-sycl-x64.zip
68.6 MB
llama-bin-win-vulkan-x64.zip
8.24 MB