Skip to content

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

CPU/CUDA: Gemma 2 FlashAttention support (#8542)

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

Annotations

1 warning

Push Docker image to Docker Hub (light-cuda, .devops/llama-cli-cuda.Dockerfile, linux/amd64)

succeeded Aug 24, 2024 in 2h 11m 3s