Skip to content

Commit

Permalink
feat: add group size 3 to GQA decode dispatch (#558)
Browse files Browse the repository at this point in the history
Llama 3.2 3B comes with 24 qo heads and 8 kv heads.
  • Loading branch information
abcdabcd987 authored Oct 25, 2024
1 parent 9e10936 commit 6227562
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions include/flashinfer/utils.cuh
Original file line number Diff line number Diff line change
Expand Up @@ -126,6 +126,9 @@
} else if (group_size == 2) { \
constexpr size_t GROUP_SIZE = 2; \
__VA_ARGS__ \
} else if (group_size == 3) { \
constexpr size_t GROUP_SIZE = 3; \
__VA_ARGS__ \
} else if (group_size == 4) { \
constexpr size_t GROUP_SIZE = 4; \
__VA_ARGS__ \
Expand Down

0 comments on commit 6227562

Please sign in to comment.