[Kernel][Attention] Separate Attention.kv_scale
into k_scale
and v_scale
#8632
Job | Run time |
---|---|
28s | |
21s | |
24s | |
27s | |
1m 40s |
Attention.kv_scale
into k_scale
and v_scale
#8632
Job | Run time |
---|---|
28s | |
21s | |
24s | |
27s | |
1m 40s |