GitHub - DenOfEquity/PixArt-Sigma-for-webUI: PixArt Sigma in Forge webUI for Stable Diffusion (and Alpha, LCM, DMD)

PixArt Sigma (and Alpha) for webui

Forge tested, probably A1111 too

I don't think there is anything Forge specific here.

works for me ^TM on 8Gb VRAM, 16Gb RAM (GTX1070)

Install

Go to the Extensions tab, then Install from URL, use the URL for this repository.

screenshot

current UI

downloads models on demand - minimum will be ~20Gb

needs updated diffusers

Alpha needs updated transformers and tokenizers

As of 27/05/2024, diffusers is offically up to version 0.28.0, so installing a beta is no longer necessary.

Easiest way to ensure necessary versions are installed is to edit requirements.text and requirements_versions.txt in the webUI folder. Otherwise starting the webUI will undo the tokenizers upgrade.

diffusers>=0.28.1
tokenizers>=0.19
transformers==4.40

These upgrades didn't break anything for me.

At your own risk. This is ~~barely~~ ~~moderately~~ somewhat tested, ~~and even then~~ but only on my computer. Models will be downloaded automatically, on demand (so if you never generate with the 256 model, it'll never be downloaded). The T5 text encoder is around 18Gb and the image models are about 2.3Gb each. The models include the float32 T5 model, which I convert to fp16 on first use and save the result for all future runs. This conversion and saving is automatic. Once done, the full size float32 model could be deleted to reclaim some space.

Note that Alpha and Sigma both use the same T5 text encoder, my caching system means it should only be downloaded once regardless for how many models are used.

I can generate using all models, though the 2K Sigma model is very slow on my limited hardware. It's undertrained anyway, at present.

25/06/2024

added option to caption using Florence-2, in image to image section. 'P' button toggles overwriting prompt, results always written to console.
minor code improvements

17/06/2024

tweaked logic for model identification. Added new finetune, seems likely to continue to be updated, ptx0/pixart-sigma to models list. Info

16/06/2024

settings to colourize the initial noise. This is essentially free extra control. Leave strength at 0.0 to bypass it. Doesn't seem as effective with PixArt as with other models, but still something to experiment with.

09/06/2024

added modelsListPA.py in extension directory - edit in any text editor to hide models you don't care about or add new models (must be huggingface diffusers style checkpoints)
added button to toggle resolution binning, enabled by default and generally best left enabled IMO. I was testing disabling it for potential hires fix type uses, with some limited success around x1.5 upscale. x2 was messy. Maybe repeated smaller upscales could work, but probably better to send to img2img and use an sdXL checkpoint to upscale.

07/06/2024

added flash diffusion. This is a distilled Lora (211MB) on top of Alpha1024: 4 steps, low CFG (1-2), seems much better than LCM. Forced LCM scheduler.

05/06/2024

small update to work with diffusers >= 0.28.1 : Transformer2DModel is now PixArtTransformer2DModel

27/05/2024

override sampler for LCM and DMD in right place, a bit earlier than before
added saving of DMD step in infotext when using DMD model; also don't save CFG and step count for DMD (would be redundant)

25/05/2024

avoid error message when Set Width/Height from Image without an image loaded

07/05/2024

various tweaks. The K button enables karras sigmas for schedulers/samplers, which could be worth experimenting with.
img2img

04/05/2024

UI reshuffle. again
made using OpenAI consistency decoder VAE optional (previously DMD always used it, and other Alpha models never did, Sigma models can't) - it's VRAM greedy (for me only usable on 512 models) and not always better.

03/04/2024

added SA-solver, DPM SDE
forced default sampler for LCM, DMD models
forced empty negative, 1 step, 1 CFG for DMD
cache VAE to share between models. Sharing is default behaviour for PixArt-Sigma, but the Alpha models each had their own copy. Now it'll only be downloaded the first time.
added LCM, DMD
- ~~note: DMD doesn't install correctly - repository is missing tokenizer directory. Seems like the symlinks (follow the Snapshot folder) can simply be copied from another Alpha model.~~ This shouldn't be an issue anymore, as I always fetch tokenizer from the same place rather than from each model.
simple optimisation: if prompts not changed, text_encoder stage is skipped
styles, ~~really should be in own file for easy editing~~

02/05/2024

support for PixArt-Alpha models - they use the same T5 text encoder.
fixed ~~(but still commented out)~~ saving of fp16 text encoder. Previously might not have saved in the right place.

01/05/2024

samplers
captions in gallery (where linebreaks?)
correct seeds for batches

to do

prompt: portrait photograph, woman with red hair, wearing green blazer over yellow tshirt and blue trousers, on sunny beach with dark clouds on horizon

Thanks to:

frutiemax92 for inference_pipeline.py
benjamin-bertram for ui details

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
scripts		scripts
README.md		README.md
customStylesList.py		customStylesList.py
example.png		example.png
modelsListPA.py		modelsListPA.py
screenshot.png		screenshot.png
screenshot2.png		screenshot2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixArt Sigma (and Alpha) for webui

Forge tested, probably A1111 too

works for me ^TM on 8Gb VRAM, 16Gb RAM (GTX1070)

Install

screenshot

downloads models on demand - minimum will be ~20Gb

needs updated diffusers

Alpha needs updated transformers and tokenizers

25/06/2024

17/06/2024

16/06/2024

09/06/2024

07/06/2024

05/06/2024

27/05/2024

25/05/2024

07/05/2024

04/05/2024

03/04/2024

02/05/2024

01/05/2024

to do

About

Releases

Packages

Languages

DenOfEquity/PixArt-Sigma-for-webUI

Folders and files

Latest commit

History

Repository files navigation

PixArt Sigma (and Alpha) for webui

Forge tested, probably A1111 too

works for me TM on 8Gb VRAM, 16Gb RAM (GTX1070)

Install

screenshot

downloads models on demand - minimum will be ~20Gb

needs updated diffusers

Alpha needs updated transformers and tokenizers

25/06/2024

17/06/2024

16/06/2024

09/06/2024

07/06/2024

05/06/2024

27/05/2024

25/05/2024

07/05/2024

04/05/2024

03/04/2024

02/05/2024

01/05/2024

to do

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

works for me ^TM on 8Gb VRAM, 16Gb RAM (GTX1070)

Packages