Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: add group size 3 to GQA decode dispatch (#558)
Llama 3.2 3B comes with 24 qo heads and 8 kv heads.
- Loading branch information