[Kernel][Attention] Separate Attention.kv_scale
into k_scale
and v_scale
#6081
Merged
simon-mo merged 13 commits intovllm-project:mainfrom neuralmagic:separate-key-value-scalesJul 16, 2024
+317-185
Commits
Commits on Jul 3, 2024
- committed
- committed
- committed