Does FlexAttention Support torch.vmap? #25

MiladInk · 2024-08-17T08:15:08Z

I know the scaled_dot_product_attention of torch.nn.functional does not and it is a big problem for me. Does FlexAttention support batched mode?

The text was updated successfully, but these errors were encountered:

drisspg · 2024-08-17T15:43:09Z

Cc @zou3519

Chillee · 2024-08-17T19:00:05Z

@MiladInk We haven't added support yet but it shouldn't be too hard. I've also been meaning to add vmap support for scaled_dot_product_attention... might do it this weekend.

Chillee · 2024-08-22T06:28:34Z

btw, we just added vmap support for scaled_dot_product_attention: pytorch/pytorch#133964

We'll also look into doing it for FlexAttention soon.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does FlexAttention Support torch.vmap? #25

Does FlexAttention Support torch.vmap? #25

MiladInk commented Aug 17, 2024

drisspg commented Aug 17, 2024

Chillee commented Aug 17, 2024

Chillee commented Aug 22, 2024

Does FlexAttention Support torch.vmap? #25

Does FlexAttention Support torch.vmap? #25

Comments

MiladInk commented Aug 17, 2024

drisspg commented Aug 17, 2024

Chillee commented Aug 17, 2024

Chillee commented Aug 22, 2024