Enable CK Attention for Navi31 #285

hyoon1 · 2024-11-18T23:27:22Z

Enables CK Attention for Navi31
Requires this branch of Flash Attention:
https://github.com/ROCm/flash-attention/tree/howiejay/navi_support

- Enables CK Attention for Navi31 - Requires this branch of Flash Attention: - https://github.com/ROCm/flash-attention/tree/howiejay/navi_support

gshtras

Is this similar to #281
Can the 2 efforts be combined?

Please also try to simplify the conditions, with this level of nested elseifs it's hard to follow the logic.

gshtras · 2024-12-16T23:42:24Z

vllm/attention/backends/rocm_flash_attn.py

-                    from flash_attn import flash_attn_varlen_func  # noqa: F401
-                    self.attn_func = flash_attn_varlen_func
+                if flash_attn_available:
+                    if current_platform.has_device_capability(110):


This check isn't equivalent to is_navi
On Cuda device_capability is meant to increase with each new architecture, and is meant to differentiate by new features support, such as FP8, etc.
On ROCm it is 1st digit of gfx * 10 + 2nd digit, which doesn't mean much, especially for any future architectures.

hyoon1 · 2024-12-17T00:55:43Z

Is this similar to #281 Can the 2 efforts be combined?

Please also try to simplify the conditions, with this level of nested elseifs it's hard to follow the logic.

PR #281 is specific to a particular vision model and does not call the path used in general LLM models. Therefore, it seems difficult to merge. The condition has been simplified to match the depth in the existing code.

hyoon1 requested review from maleksan85 and gshtras November 18, 2024 23:27

Enable CK Attention for Navi31

a0c1e1c

- Enables CK Attention for Navi31 - Requires this branch of Flash Attention: - https://github.com/ROCm/flash-attention/tree/howiejay/navi_support

hyoon1 force-pushed the navi_ck branch from d90b48f to a0c1e1c Compare November 19, 2024 07:23

gshtras and others added 4 commits November 19, 2024 11:30

Merge branch 'develop' into navi_ck

ba71585

Merge branch 'develop' into navi_ck

fd20600

Merge branch 'develop' into navi_ck

f087706

Merge branch 'develop' into navi_ck

64ce953

gshtras requested changes Dec 16, 2024

View reviewed changes

Simplify conditional branches and use is_navi3 for capability

714dbef

hyoon1 requested a review from gshtras December 19, 2024 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable CK Attention for Navi31 #285

Enable CK Attention for Navi31 #285

hyoon1 commented Nov 18, 2024 •

edited by github-actions bot

Loading

gshtras left a comment

gshtras Dec 16, 2024

hyoon1 Dec 17, 2024

hyoon1 commented Dec 17, 2024

Enable CK Attention for Navi31 #285

Are you sure you want to change the base?

Enable CK Attention for Navi31 #285

Conversation

hyoon1 commented Nov 18, 2024 • edited by github-actions bot Loading

gshtras left a comment

Choose a reason for hiding this comment

gshtras Dec 16, 2024

Choose a reason for hiding this comment

hyoon1 Dec 17, 2024

Choose a reason for hiding this comment

hyoon1 commented Dec 17, 2024

hyoon1 commented Nov 18, 2024 •

edited by github-actions bot

Loading