Onnx Pipeline: Inference for text to image conversion #3380

saikrishna2893 · 2023-05-10T08:37:05Z

Initial version of ONNX based inference pipeline for Text to image conversion based on Stable-diffusion models.
Performance related information: Tested on Raphael (AMD 7600X) and Raptor Lake (I5-13000K)
Onnx based inference performance - InvokeAI-pipelines.pdf

Sample output from execution of Onnx Pipeline tested on Raptor-Lake (I5-13000k) on CPU:

Sample output from execution of Pytorch Pipeline tested on Raptor-Lake (I5-13000k) on CPU:

hipsterusername · 2023-05-10T12:27:57Z

@saikrishna2893 - Thanks for the contribution!

To confirm, is this integrated into the new Nodes pipelines that are being worked on in Main?

lstein

I really appreciate the work that went into this.

I'm sad to say that this will have to be modified in order to work with nodes. In particular, CLI.py is going to disappear from the repository soon. Please take a look at the invokeai/app tree, in particular invokeai/app/invocation/latents.py, to understand how the new text to image inference system works.

lstein · 2023-05-10T17:51:39Z

Initial version of ONNX based inference pipeline for Text to image conversion based on Stable-diffusion models. Performance related information: Tested on Raphael (AMD 7600X) and Raptor Lake (I5-13000K) Onnx based inference performance - InvokeAI-pipelines.pdf

Sample output from execution of Onnx Pipeline tested on Raptor-Lake (I5-13000k):

Sample output from execution of Pytorch Pipeline tested on Raptor-Lake (I5-13000k):

Does the ONNYX pipeline take advantage of CUDA, and if so, how does it perform?

lstein · 2023-05-10T17:57:21Z

Also note the CI failures.

saikrishna2893 · 2023-05-16T04:24:49Z

Initial version of ONNX based inference pipeline for Text to image conversion based on Stable-diffusion models. Performance related information: Tested on Raphael (AMD 7600X) and Raptor Lake (I5-13000K) Onnx based inference performance - InvokeAI-pipelines.pdf
Sample output from execution of Onnx Pipeline tested on Raptor-Lake (I5-13000k):
Sample output from execution of Pytorch Pipeline tested on Raptor-Lake (I5-13000k):

Does the ONNYX pipeline take advantage of CUDA, and if so, how does it perform?

The Onnx Pipeline currently uses CPU as its device. Pipeline makes use of OpenVINO execution provider for enhanced optimized inferencing.

saikrishna2893 · 2023-05-16T04:28:24Z

I really appreciate the work that went into this.

I'm sad to say that this will have to be modified in order to work with nodes. In particular, CLI.py is going to disappear from the repository soon. Please take a look at the invokeai/app tree, in particular invokeai/app/invocation/latents.py, to understand how the new text to image inference system works.

@lstein can you point out any documentation related to use of app and node structure. Example commands to run and test. We have done code through of the invokeai/app. Quite unclear on some of the working. We have checked PR#3180, description from discussion page and other PR's. Have seen some commands using pipe to create multiple sessions of inference. Any further information on this would be helpful. Thanks.

Faced errors when running following commands:

txt2img --prompt "an old man reading newspaper" - below quoted error
t2l --prompt "an old man reading newspaper" | l2i - Warning --> Invalid command.

(test_Invoekai) C:\Users\user1\Invokeai\InvokeAI>python scripts\invoke-new.py
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> Patchmatch initialized
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> Initializing, be patient...
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> Initialization file C:\Users\mcw\invokeai\invokeai.init found. Loading...
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> InvokeAI, version 3.0.0+a0
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> InvokeAI runtime directory is "C:\Users\mcw\invokeai"
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> Model manager initialized
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> GFPGAN Initialized
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> CodeFormer Initialized
[16-05-2023 03:48:08]::[InvokeAI]::INFO --> Face restoration initialized
**invoke> txt2img --prompt "an old man reading newspaper"**
[16-05-2023 03:48:14]::[InvokeAI]::INFO --> Loading diffusers model from runwayml/stable-diffusion-v1-5
[16-05-2023 03:48:14]::[InvokeAI]::DEBUG --> Using more accurate float32 precision
[16-05-2023 03:48:14]::[InvokeAI]::DEBUG --> Loading diffusers VAE from stabilityai/sd-vae-ft-mse
[16-05-2023 03:48:14]::[InvokeAI]::DEBUG --> Using more accurate float32 precision
[16-05-2023 03:48:16]::[InvokeAI]::DEBUG --> Default image dimensions = 512 x 512
[16-05-2023 03:48:16]::[InvokeAI]::INFO --> Loading embeddings from C:\Users\mcw\Documents\InvokeAI_org\invokeai\embeddings
[16-05-2023 03:48:16]::[InvokeAI]::INFO --> Textual inversion triggers:
[16-05-2023 03:48:16]::[InvokeAI]::INFO --> Model loaded in 2.13s
Generating:   0%|                                                                                                                                | 0/1 [00:00<?, ?it/s]
[16-05-2023 03:48:17]::[InvokeAI]::ERROR --> Error in node fe27d5da-b317-43e1-851e-5ec112abcdb8 (source node 0): Traceback (most recent call last):
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\app\services\processor.py", line 70, in __process
    outputs = invocation.invoke(
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\app\invocations\generate.py", line 92, in invoke
    generate_output = next(outputs)
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\generator\base.py", line 144, in generate
    results = generator.generate(prompt,
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\generator\base.py", line 374, in generate
    image = make_image(x_T, seed)
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\generator\txt2img.py", line 65, in make_image
    pipeline_output = pipeline.image_from_embeddings(
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\stable_diffusion\diffusers_pipeline.py", line 480, in image_from_embeddings
    result_latents, result_attention_map_saver = self.latents_from_embeddings(
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\stable_diffusion\diffusers_pipeline.py", line 523, in latents_from_embeddings
    result: PipelineIntermediateState = infer_latents_from_embeddings(
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\stable_diffusion\diffusers_pipeline.py", line 207, in __call__
    callback(result)
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\app\invocations\generate.py", line 66, in dispatch_progress
    stable_diffusion_step_callback(
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\app\util\step_callback.py", line 40, in stable_diffusion_step_callback
    image = Generator.sample_to_lowres_estimated_image(sample)
  File "C:\Users\user1\Invokeai\InvokeAI\invokeai\backend\generator\base.py", line 514, in sample_to_lowres_estimated_image
    latent_image = samples[0].permute(1, 2, 0) @ v1_5_latent_rgb_factors
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half'

[16-05-2023 03:48:17]::[InvokeAI]::WARNING --> Session error: creating a new session

hipsterusername · 2023-06-07T15:35:41Z

Hello! Following up on the request from discussions with @lalith-mcw here - CCing @lstein and @StAlKeR7779 for visibility.

To confirm, is there a reason you're looking for CLI documentation? I ask because 3.0 supports a graph-based API, that can be accessed via OpenAPI documentation.

It may be easier to get direct implementation support if you join Discord. That is where we offer live Dev feedback and Q&A, and where a number of folks find implementation guidance.

In any case, I believe that you'll need the following guidance, which I believe @lstein and/or @StAlKeR7779 can provide more details on:

Using the new Model Manager (Model Manager rewrite #3335) to incorporate the ONNX model configuration.
Updating node logic in Latents.py or in the Diffuser pipeline to support the new model format

If you reach out to me on Discord, I can create a channel for us to discuss this project. Thanks again for you and the team's support

lalith-mcw · 2023-06-07T16:15:09Z

To confirm, is there a reason you're looking for CLI documentation? I ask because 3.0 supports a graph-based API, that can be accessed via OpenAPI documentation.

Currently in this PR we do provide an option for the user to select their own model type between torch/onnx. These were updated within args.py and cli.py with previous format. We do try to integrate the same with the graph based node invocations. I'll try to start a chat in discord, thanks.

psychedelicious · 2023-08-05T02:14:18Z

Hi @saikrishna2893 , we have implemented Onnx support in #3562. It is integrated in the nodes backend, but support is limited to text to image only for now.

saikrishna2893 and others added 30 commits May 10, 2023 00:19

Update scripts to support onnx pipeline

3c80e4c

Update inference pipelines

af4fde7

Update inference scripts with latest version

5aa54ff

Add OVRT installation requirements

cd779c1

Update files for inference

3a1e987

Update files based on conversion scripts

73c0c2b

Update invoke-optimized files

c8a9f28

Update files with package imports

03479b9

Update files to pure onnx

696b498

Update files to fix model configuration

faeb5e3

Update inference files to run with generate function

7c74094

Update files to fix naming conventions

c3086ab

Clean up files and add fix for model name

f8a57e0

Update files based on comments

46259ae

Remove ldm/args.py file

9bde7ac

Update pipeline files and naming convention

3e4eecb

Modularize the pipeline

d1f9c84

Update files based on code structuring

3054386

Update base class and args

ac172fb

Update files as part of modularization

1124ca1

Update files to fix errors

ffc7d43

Update files based on review comments

9714102

Cleanup files based on comments

1b813af

Update files to create output directory

d7a8df6

Update files to fix pytorch execution and comments

87ab2d5

Update CLI file

441391e

Cleanup of code

ca40cce

Update files based on review comments

6faa09a

Update changes to readme file

fb07635

Update changes to cross_attention based on diffusers package

d1b371b

saikrishna2893 requested review from lstein, damian0815, blessedcoolant, JPPhoto and GreggHelt2 as code owners May 10, 2023 08:37

saikrishna2893 mentioned this pull request May 10, 2023

invoke-ai: Onnx inference based pipeline for Text to Image conversion #3379

Closed

Update version to fix the issues with compel module

d060dc9

lstein requested changes May 10, 2023

View reviewed changes

saikrishna2893 mentioned this pull request May 18, 2023

[bug]: Inference pipeline breaks with app/cli_app.py #3429

Closed

psychedelicious closed this Aug 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onnx Pipeline: Inference for text to image conversion #3380

Onnx Pipeline: Inference for text to image conversion #3380

saikrishna2893 commented May 10, 2023 •

edited

Loading

hipsterusername commented May 10, 2023

lstein left a comment

lstein commented May 10, 2023

lstein commented May 10, 2023

saikrishna2893 commented May 16, 2023

saikrishna2893 commented May 16, 2023 •

edited

Loading

hipsterusername commented Jun 7, 2023

lalith-mcw commented Jun 7, 2023

psychedelicious commented Aug 5, 2023

Onnx Pipeline: Inference for text to image conversion #3380

Onnx Pipeline: Inference for text to image conversion #3380

Conversation

saikrishna2893 commented May 10, 2023 • edited Loading

hipsterusername commented May 10, 2023

lstein left a comment

Choose a reason for hiding this comment

lstein commented May 10, 2023

lstein commented May 10, 2023

saikrishna2893 commented May 16, 2023

saikrishna2893 commented May 16, 2023 • edited Loading

hipsterusername commented Jun 7, 2023

lalith-mcw commented Jun 7, 2023

psychedelicious commented Aug 5, 2023

saikrishna2893 commented May 10, 2023 •

edited

Loading

saikrishna2893 commented May 16, 2023 •

edited

Loading