tensor core GQA dispatch for [4,5,6,8]
(#1258)
#505
Job | Run time |
---|---|
14m 40s | |
13m 55s | |
28m 35s |
[4,5,6,8]
(#1258)
#505
Job | Run time |
---|---|
14m 40s | |
13m 55s | |
28m 35s |