Redo custom attention processor to support other attention types #6550

StAlKeR7779 · 2024-06-27T13:33:07Z

Summary

Current attention processor implements only torch-sdp attention type, so when any ip-adapter or regional prompt used, we override model to run torch-sdp attention.
New attention processor combines 4 attention processors(normal, sliced, xformers, torch-sdp) by moving parts of attention that differs(mask preparation and attention itself), to separate function call, where required implementation executed.

Related Issues / Discussions

None

QA Instructions

Change attention_type in invokeai.yaml and then run generation with ip-adapter or regional prompt.

Merge Plan

None?

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)

@dunkeroni @RyanJDick

RyanJDick · 2024-06-27T14:34:20Z

I haven't looked at the code yet, but do you know if there are still use cases for using attention processors other than Torch 2.0 SDP? Based on the benchmarking that diffusers has done, it seems like the all around best choice. But maybe there are still reasons to use other implementation e.g. very-low-vram system?

StAlKeR7779 · 2024-06-27T14:54:41Z

I thought roughly same:
normal - generally no need in it
xformers - if you said that torch-sdp on par or even faster, then too can be removed
sliced - yes it's suitable for low memory situations, and I think it's main attention for mps

psychedelicious · 2024-06-28T02:28:36Z

On CUDA, torch's SDP was faster than xformers for me when I last checked a month or so back. IIRC it was just a couple % faster.

Redo attention processor to support other attention types

cd2dccf

github-actions bot added python PRs that change python files backend PRs that change backend files labels Jun 27, 2024

StAlKeR7779 marked this pull request as ready for review June 27, 2024 14:02

StAlKeR7779 requested review from lstein, blessedcoolant, brandonrising, RyanJDick and hipsterusername as code owners June 27, 2024 14:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redo custom attention processor to support other attention types #6550

Redo custom attention processor to support other attention types #6550

StAlKeR7779 commented Jun 27, 2024 •

edited

Loading

RyanJDick commented Jun 27, 2024

StAlKeR7779 commented Jun 27, 2024

psychedelicious commented Jun 28, 2024

Redo custom attention processor to support other attention types #6550

Are you sure you want to change the base?

Redo custom attention processor to support other attention types #6550

Conversation

StAlKeR7779 commented Jun 27, 2024 • edited Loading

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

RyanJDick commented Jun 27, 2024

StAlKeR7779 commented Jun 27, 2024

psychedelicious commented Jun 28, 2024

StAlKeR7779 commented Jun 27, 2024 •

edited

Loading