Skip to content

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

CPU/CUDA: Gemma 2 FlashAttention support (#8542)

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

Annotations

1 warning

Push Docker image to Docker Hub (server-rocm, .devops/llama-server-rocm.Dockerfile, linux/amd64,l...

succeeded Aug 24, 2024 in 38m 59s