feat(nodes): migrate cnet nodes away from `controlnet_aux` #6831

psychedelicious · 2024-09-10T11:34:46Z

Summary

This gets use closer to removing our dependency on the controlnet_aux package. This package provides a handful of classes that perform "controlnet preprocessing". Most of the classes which run a ML model.

Why

Those classes have baked in image resizing logic, apparently intended for use in the A1111 controlnet extension:

detect_resolution: int arg: Resizes the image to fit in the given dimension before running it through the processor
image_resolution: int arg: Resizes the image to fit in the given dimension after running it through the processor
Some models require the input image dimensions to be multiples of 8 (but they resized images to the nearest multiple of 64)

There are issues with this API:

You have to carefully select the resolution args to get the right size of image back
It seems that occasionally, even when you provide the right dimension to the resolution args, the output is just slightly off
The models that require image dimensions to be multiples of 8 would sometimes not resize the image at the end
The extra resizing takes some amount of time
The extra resizing causes minor degradation in image quality
The models are loaded directly from HF, bypassing the model manager & its cache

Up until this point, these issues have not caused problems, because our controlnet implementation automatically resizes control images just before generation. This meant that the dimensions of the processed control images didn't really matter.

With Canvas v2, control image processing is implemented as layer filters. After processing, the image is put back onto the canvas, and should be the exact same size as it was before processing. It is not acceptable for the images to be differently sized.

How

Instead of making potentially breaking and hairy changes to the existing "controlnet processor" nodes, I've created a new set of nodes to be on filter duty in Canvas v2:

Canny Edge Detection
Color Map
Content Shuffle
Depth Anything
HED Edge Detection
Lineart Anime Edge Detection
Lineart Edge Detection
MediaPipe Face Detection
MLSD Edge Detection
Normal Map Generation
PiDiNet Edge Detection

I have not migrated:

The OG SAM node (I don't think it has any use-case currently)
Leres, Midas and Zoe depth nodes (Depth Anything is superior, there's no good reason to use any of these over it)

Nodes that have models have revised, separate classes that give our model manager control over model downloading, caching and loading.

This change does bring in a lot of code from controlnet_aux. The biggest chunk is the backend support for NormalBAE. Turns out there's a whole git repo embedded in controlnet_aux for the EfficientNet architecture... I understand the timm package can be used instead but I didn't pursue this.

As mentioned, the new nodes skip all image resizing except where necessary for the model to run, in which case the images are resized back to the original dimensions before the node finishes. This makes it a lot easier to use these nodes too - just provide the image and settings. No need to futz around with image_resolution and detect_resolution.

There are no changes to the existing nodes, so existing workflows will not break. That said, we may want to add a new value to the Classification enum used by nodes so that we can deprecate the existing nodes. Eventually we can drop the controlnet_aux dependency entirely.

Related Issues / Discussions

Discord & offline discussion

QA Instructions

I did a few rounds of testing:

With an input image with multiple of 8 width/height
With an input image with odd number width/height
Side-by-side with the other versions of the nodes
With all permutations of settings (not comparing outputs - just making sure they did something and didn't break anything)

In all cases, the outputs were identical where possible (the controlnet_aux resizing logic makes this impossible in some situations).

Merge Plan

Once this merges, I'll update all of the linear UI to use the new nodes. Maybe there are some default workflows to update also?

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)

psychedelicious · 2024-09-10T11:38:40Z

Note for reviewers: The vast majority of this PR is copy-pasted code from controlnet_aux (nearly all of which was itself copied from some other source).

Review should focus on:

The new invocations
backend/image_util/*.py
The __init__.py files in the newly-added folders in backend/image_util/, which contain the revised model loading and execution classes for the nodes

hipsterusername · 2024-09-10T12:35:56Z

Deprecation class should be instituted. We probably want to de-clutter the node library sooner v later. It will be confusing for users.

Or, maybe, deprecated nodes don't show up in the node library and are only visible on old workflows?

Similar to the existing node, but without the resolution fields.

Similar to the existing node, but without any resizing and with a revised model loading API.

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager.

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager. All code related to the invocation now lives in the Invoke repo.

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager. All code related to the invocation now lives in the Invoke repo. Unfortunately, this includes a whole git repo for EfficientNet. I believe we could use the package `timm` instead of this, but it's beyond me.

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager. All code related to the invocation now lives in the Invoke repo.

Similar to the existing node, but without any resizing. The backend logic was consolidated and modified so that it the model loading can be managed by the model manager. The ONNX Runtime `InferenceSession` class was added to the `AnyModel` union to satisfy the type checker.

- Changed name - Better field names

- Better field names

Human-readable field names.

…ssors

Use a generic to narrow the `type` field from `string` to a literal. Now you can do e.g. `adapter.type === 'control_layer_adapter'` and TS narrows the type.

They will still be usable if a workflow uses one. You just cannot add them directly.

It's a line segment detector, not general edge detector.

- Add backcompat for cnet model default settings - Default filter selection based on model type - Updated UI components to use new filter nodes - Added handling for failed filter executions, preventing filter from getting stuck in case it failed for some reason - New translations for all filters & fields

This makes it a bit easier to call the action

psychedelicious · 2024-09-11T11:56:52Z

Added DW Openpose filter
Renamed some of the cryptic filter settings from ML research abbreviations to human words
Added Classification.Deprecated, applied to the old cnet processor nodes, hidden from UI (but will still work if an existing workflow is loaded using them)
Fix a couple bugs in MLSD detector
Fix a race condition with progress bar/queue count which tended to happen when executing filters (which execute really fast)
Updated UI to use the new filters
Default filter now linked to control model
Added filter button next to control model drop down
Control & raster layers can have an image dropped on them to replace the layer's content
Add "pull bbox into" button for control layers, global ip adapters, regional ip adapters

psychedelicious requested review from lstein, blessedcoolant, brandonrising, RyanJDick and hipsterusername as code owners September 10, 2024 11:34

github-actions bot added python PRs that change python files Root invocations PRs that change invocations backend PRs that change backend files python-deps PRs that change python dependencies labels Sep 10, 2024

psychedelicious added 9 commits September 11, 2024 14:16

feat(nodes): add CannyEdgeDetectionInvocation

d731e42

Similar to the existing node, but without the resolution fields.

feat(nodes): add ColorMapGeneratorInvocation

ce9d1da

Similar to the existing node, but without the resolution fields.

feat(nodes): add ContentShuffleInvocation

6483fd5

Similar to the existing node, but without the resolution fields.

feat(nodes): add DepthAnythingDepthEstimationInvocation

70a480d

Similar to the existing node, but without any resizing and with a revised model loading API.

feat(nodes): add HEDEdgeDetectionInvocation

9edb410

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager.

feat(nodes): add LineartAnimeEdgeDetectionInvocation

d4a99d8

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager.

feat(nodes): add LineartEdgeDetectionInvocation

e26b6bc

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager.

feat(nodes): add MediaPipeFaceDetectionInvocation

d3c5865

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager. All code related to the invocation now lives in the Invoke repo.

feat(nodes): add MLSDEdgeDetectionInvocation

99545f8

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager. All code related to the invocation now lives in the Invoke repo.

psychedelicious force-pushed the psyche/feat/migrate-cnet-nodes branch from 8dab787 to 40cc149 Compare September 11, 2024 04:16

psychedelicious added 8 commits September 11, 2024 18:54

feat(nodes): add PiDiNetEdgeDetectionInvocation

3885390

Similar to the existing node, but without any resizing and with a revised model loading API that uses the model manager. All code related to the invocation now lives in the Invoke repo.

feat(nodes): update color map node

1befc97

- Changed name - Better field names

feat(nodes): update content shuffle node

1a185e9

- Better field names

feat(nodes): update mlsd node

c07da6d

Human-readable field names.

feat(nodes): update pidinet node

034395b

Human-readable field names.

feat(nodes): add Classification.Deprecated, deprecated old cnet proce…

94c4451

…ssors

psychedelicious added 9 commits September 11, 2024 18:54

chore(ui): typegen

6dedf10

feat(ui): improve typing on CanvasEntityAdapterBase

dbb1b66

Use a generic to narrow the `type` field from `string` to a literal. Now you can do e.g. `adapter.type === 'control_layer_adapter'` and TS narrows the type.

feat(ui): hide deprecated nodes from add node menu

8940ba5

They will still be usable if a workflow uses one. You just cannot add them directly.

tidy(nodes): MLSDEdgeDetection -> MLSDDetection

a239d80

It's a line segment detector, not general edge detector.

chore(ui): typegen

0af70fa

fix(nodes): MLSD needs inputs to be multiples of 64

66489b7

fix(nodes): handle no detected line segments

b422968

fix(ui): progress bar/queue count race condition

80dc44e

psychedelicious force-pushed the psyche/feat/migrate-cnet-nodes branch from 40cc149 to 7b8cc8b Compare September 11, 2024 10:52

psychedelicious requested a review from maryhipp as a code owner September 11, 2024 10:52

github-actions bot added the frontend PRs that change frontend files label Sep 11, 2024

psychedelicious added 4 commits September 11, 2024 20:58

feat(ui): add filter button next to control adapter model

ac0a0e3

feat(ui): entityRasterized action only needs position, not rect

b91fbdd

This makes it a bit easier to call the action

feat(ui): drop image on layer to replace it

1c0fb71

feat(ui): pull bbox into functionality for control/ip adapters

2f6152a

hipsterusername approved these changes Sep 11, 2024

View reviewed changes

hipsterusername merged commit 88dcb38 into main Sep 11, 2024
14 checks passed

hipsterusername deleted the psyche/feat/migrate-cnet-nodes branch September 11, 2024 12:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(nodes): migrate cnet nodes away from `controlnet_aux` #6831

feat(nodes): migrate cnet nodes away from `controlnet_aux` #6831

psychedelicious commented Sep 10, 2024

psychedelicious commented Sep 10, 2024

hipsterusername commented Sep 10, 2024

psychedelicious commented Sep 11, 2024

feat(nodes): migrate cnet nodes away from controlnet_aux #6831

feat(nodes): migrate cnet nodes away from controlnet_aux #6831

Conversation

psychedelicious commented Sep 10, 2024

Summary

Why

How

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

psychedelicious commented Sep 10, 2024

hipsterusername commented Sep 10, 2024

psychedelicious commented Sep 11, 2024

feat(nodes): migrate cnet nodes away from `controlnet_aux` #6831

feat(nodes): migrate cnet nodes away from `controlnet_aux` #6831