Brandon/cast unquantized flux to bfloat16 #6815

brandonrising · 2024-09-05T17:09:20Z

Summary

This PR forces unquantized flux model state dicts to be converted to bfloat16 since that's all we support for inference currently. This can be removed or minimized as we support more data types in the future. This will allow us to support the seemingly common float8 flux fine tunes available on sites like huggingface and civitAI.

QA Instructions

Attempt to perform inference with a model similar to this one

Merge Plan

Can be merged when tested and approved

Checklist

The PR has a short but descriptive title, suitable for a changelog

RyanJDick

I left a handful of comments - mostly around all of the explicit memory management we are doing with (del and gc.collect()).

I tested it as-is and it works for me. The quality seems pretty clearly worse than a 4-bit quantized model though 😅

invokeai/backend/model_manager/load/model_loaders/flux.py

invokeai/backend/model_manager/util/model_util.py

…mory during upcasting

brandonrising requested review from lstein, blessedcoolant, RyanJDick and hipsterusername as code owners September 5, 2024 17:09

github-actions bot added python PRs that change python files backend PRs that change backend files labels Sep 5, 2024

hipsterusername approved these changes Sep 5, 2024

View reviewed changes

RyanJDick reviewed Sep 5, 2024

View reviewed changes

RyanJDick approved these changes Sep 5, 2024

View reviewed changes

brandonrising added 5 commits September 5, 2024 15:38

Cast tensors in unquantized flux models to bfloat16 during loading

991d264

Update flux transformer loader to more efficiently use and release me…

8150a58

…mory during upcasting

Add comment explaining the cache make room call

289ee12

Remove dependency of asizeof

816aac8

Simplify flux model dtype conversion in model loader

f08e942

brandonrising force-pushed the brandon/cast-unquantized-flux-to-bfloat16 branch from 117e3e0 to f08e942 Compare September 5, 2024 19:38

brandonrising enabled auto-merge (rebase) September 5, 2024 19:39

brandonrising merged commit a16b555 into main Sep 5, 2024
14 checks passed

brandonrising deleted the brandon/cast-unquantized-flux-to-bfloat16 branch September 5, 2024 19:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Brandon/cast unquantized flux to bfloat16 #6815

Brandon/cast unquantized flux to bfloat16 #6815

brandonrising commented Sep 5, 2024 •

edited

Loading

RyanJDick left a comment

Brandon/cast unquantized flux to bfloat16 #6815

Brandon/cast unquantized flux to bfloat16 #6815

Conversation

brandonrising commented Sep 5, 2024 • edited Loading

Summary

QA Instructions

Merge Plan

Checklist

RyanJDick left a comment

Choose a reason for hiding this comment

brandonrising commented Sep 5, 2024 •

edited

Loading