CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

Sign in to view logs

Triggered via push August 24, 2024 21:38

jploski

master

Status Success

Total duration 1h 12m 49s

Artifacts 17

build.yml

on: push

Matrix: windows-latest-cmake-cuda

Matrix: windows-latest-cmake

macOS-latest-cmake-arm64

macOS-latest-cmake-x64

ubuntu-focal-make

ubuntu-latest-cmake

macOS-latest-make

macOS-latest-cmake

ubuntu-focal-make-curl

ubuntu-latest-cmake-rpc

ubuntu-22-cmake-vulkan

ubuntu-22-cmake-hip

ubuntu-22-cmake-sycl

ubuntu-22-cmake-sycl-fp16

macOS-latest-cmake-ios

macOS-latest-cmake-tvos

windows-latest-cmake-sycl

windows-latest-cmake-hip

ios-xcode-build

Matrix: macOS-latest-swift

Matrix: ubuntu-latest-cmake-sanitizer

Matrix: windows-msys2

Annotations

1 error and 9 warnings

windows-latest-cmake (avx512-x64, -DGGML_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DGGML_RPC=ON -DGGML_...

Process completed with exit code 1.

ubuntu-22-cmake-sycl-fp16

The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/

ubuntu-22-cmake-sycl-fp16

The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

ubuntu-22-cmake-sycl

The following actions uses node12 which is deprecated and will be forced to run on node16: actions/checkout@v2. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/

ubuntu-22-cmake-sycl

The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v2. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

windows-latest-cmake-hip

The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

android-build

The following actions use a deprecated Node.js version and will be forced to run on node20: actions/setup-java@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

ubuntu-22-cmake-hip

The following actions use a deprecated Node.js version and will be forced to run on node20: actions/checkout@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

The following actions uses node12 which is deprecated and will be forced to run on node16: actions/github-script@v3. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/

The following actions use a deprecated Node.js version and will be forced to run on node20: anzz1/action-create-release@v1, actions/github-script@v3. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/

Artifacts

Produced during runtime

Name	Size
cudart-llama-bin-win-cu11.7.1-x64.zip	293 MB
cudart-llama-bin-win-cu12.2.0-x64.zip	412 MB
llama-bin-macos-arm64.zip	48.4 MB
llama-bin-macos-x64.zip	50 MB
llama-bin-ubuntu-x64.zip	54 MB
llama-bin-win-avx-x64.zip	7.65 MB
llama-bin-win-avx2-x64.zip	7.65 MB
llama-bin-win-avx512-x64.zip	7.65 MB
llama-bin-win-cu11.7.1-x64.zip	143 MB
llama-bin-win-cu12.2.0-x64.zip	142 MB
llama-bin-win-kompute-x64.zip	7.92 MB
llama-bin-win-llvm-arm64.zip	11.1 MB
llama-bin-win-msvc-arm64.zip	13.2 MB
llama-bin-win-noavx-x64.zip	7.64 MB
llama-bin-win-openblas-x64.zip	18.6 MB
llama-bin-win-sycl-x64.zip	68.6 MB
llama-bin-win-vulkan-x64.zip	8.24 MB