Remove conditional import of `flash_attn` #43

b8raoult · 2024-09-16T08:08:05Z

What happened?

Inference crashes without any meaningful error message (just exists back to the shell) when training was done with flash_attn installed and not inference, and vice-versa.

What are the steps to reproduce the bug?

Two ways to reproduce the issue

Train a model with flash_attn installed, run inference with flash_attn not installed
Train the model without flash_attn, run inference in an environment where flash_attn is installed

This can also apply to a training that is restarted from a checkpoint.

Version

all

Platform (OS and architecture)

any

Relevant log output

try:
    from flash_attn import flash_attn_func as attn_func
except ImportError:
    from torch.nn.functional import scaled_dot_product_attention as attn_func

    _FLASH_ATTENTION_AVAILABLE = False
else:
    _FLASH_ATTENTION_AVAILABLE = True

Accompanying data

No response

Organisation

No response

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove conditional import of `flash_attn` #43

Remove conditional import of `flash_attn` #43

b8raoult commented Sep 16, 2024 •

edited

Loading

Remove conditional import of flash_attn #43

Remove conditional import of flash_attn #43

Comments

b8raoult commented Sep 16, 2024 • edited Loading

What happened?

What are the steps to reproduce the bug?

Version

Platform (OS and architecture)

Relevant log output

Accompanying data

Organisation

Remove conditional import of `flash_attn` #43

Remove conditional import of `flash_attn` #43

b8raoult commented Sep 16, 2024 •

edited

Loading