UniPC for diffusion sampling #2684

nicksenger · 2024-12-27T22:30:56Z

Hi, thanks for the awesome library! It's great having these tools available in the Rust ecosystem.

I was interested in low step-count inference for some experiments, so ported over the UniPC scheduler. I figured I'd open a PR here in case this work is useful to others. Here is a comparison with Euler A and DDIM at 5 steps which demonstrates UniPC's benefits for quick convergence:

LaurentMazare · 2024-12-30T07:33:27Z

Would you have some examples with good quality where this makes a difference?

nicksenger · 2024-12-30T23:36:28Z

Sure, here is a comparison of DDIM (default cfg) and UniPC (corrector enabled past step 2, Bh2 solver type, otherwise defaults) on sd1.5 across 50 steps with the following prompt with seed 1984 and guidance scale 15:

a rusty robot holding a fire torch in its hand,android scrapyard,oxidized,intricate exposed wires,arcing,neon lights,sci-fi,dystopia,futuristic city background,silhouettes,strange illuminated mannequins,geisha billboard,flying cars,4k,cinematic lighting,photo-realistic,extremely detailed,high quality,epic,lfg

negative prompt:

reduction,reducing agents,electron rich,drawing,illustrated,low quality,out of focus,blurry,cut off,simple,clean,organized,daytime,utopia,peaceful,low contrast,lowres,worst quality

6 steps is about where the output starts becoming clear from both schedulers, but the composition is quite different:

At 50 steps both have converged to a similar output with some differences. Which one looks better is a bit subjective, but I think most would agree the composition is closer to the 6-step output from UniPC than that from DDIM:

Just for fun here's the 50-step output from UniPC with the same settings as above but using 3 for the solver order:

Somewhat different output again, but overall the composition pretty much resembles what it produces at 5 steps:

candle-transformers/src/models/stable_diffusion/uni_pc.rs

LaurentMazare · 2024-12-31T04:25:40Z

candle-transformers/src/models/stable_diffusion/uni_pc.rs

+    }
+
+    #[derive(Clone, Copy)]
+    struct FloatOrd(f64);


Could you give some insights on the ordering that this provides?

I added a comment in fedce6287405fe04826344c3b5537375b317760a, the ordering is:

NaN | -Infinity | x < 0 | -0 | +0 | x > 0 | +Infinity | NaN

It's the same strategy used by float-ord, which is in turn a dependency of the average crate where this quantile computation comes from. These are only used for the dynamic thresholding, so I thought it'd be better to hide the logic within this module versus introducing additional dependencies crate-wide.

Agreed that it's better not to introduce a dependency but why not use the total ordering f64::total_cmp? Do you expect some differences with it that would be helpful here?

Ah, nope. This was just an oversight on my part, I thought the msrv for the project was lower for some reason, but now I'm seeing total_cmp is used in quite a few other places. It looks like the standard lib uses the same method: https://doc.rust-lang.org/src/core/num/f64.rs.html#1350

Updated in f0384fa

candle-transformers/src/models/stable_diffusion/uni_pc.rs

LaurentMazare · 2024-12-31T04:30:35Z

Thanks for the detailed analysis, I've put a bunch of comments inline, please also look at the clippy and rustfmt failures in the CI.

nicksenger · 2024-12-31T19:24:38Z

Thanks for the detailed analysis, I've put a bunch of comments inline, please also look at the clippy and rustfmt failures in the CI.

Thanks, these should be addressed now.

LaurentMazare · 2025-01-01T20:34:26Z

Thanks!

super-fun-surf · 2025-01-02T19:51:25Z

stoked!

feat: Add unipc multistep scheduler

28a11db

LaurentMazare reviewed Dec 31, 2024

View reviewed changes

nicksenger added 6 commits December 31, 2024 09:26

chore: Clippy and formatting

eada6a3

chore: Update comments

fedce62

chore: Avoid unsafety in float ordering

b95f2f5

refactor: Update Scheduler::step mutability requirements

73e9bc9

fix: Corrector img2img

7c279b5

chore: Update unipc ref link to latest diffusers release

2204ef8

nicksenger added 2 commits January 1, 2025 16:24

chore: Deduplicate float ordering

f0384fa

fix: Panic when running with dev profile

c003ab2

LaurentMazare merged commit cbaa0ad into huggingface:main Jan 1, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UniPC for diffusion sampling #2684

UniPC for diffusion sampling #2684

nicksenger commented Dec 27, 2024

LaurentMazare commented Dec 30, 2024

nicksenger commented Dec 30, 2024

LaurentMazare Dec 31, 2024

nicksenger Dec 31, 2024

LaurentMazare Jan 1, 2025

nicksenger Jan 1, 2025

LaurentMazare commented Dec 31, 2024

nicksenger commented Dec 31, 2024

LaurentMazare commented Jan 1, 2025

super-fun-surf commented Jan 2, 2025

UniPC for diffusion sampling #2684

UniPC for diffusion sampling #2684

Conversation

nicksenger commented Dec 27, 2024

LaurentMazare commented Dec 30, 2024

nicksenger commented Dec 30, 2024

LaurentMazare Dec 31, 2024

Choose a reason for hiding this comment

nicksenger Dec 31, 2024

Choose a reason for hiding this comment

LaurentMazare Jan 1, 2025

Choose a reason for hiding this comment

nicksenger Jan 1, 2025

Choose a reason for hiding this comment

LaurentMazare commented Dec 31, 2024

nicksenger commented Dec 31, 2024

LaurentMazare commented Jan 1, 2025

super-fun-surf commented Jan 2, 2025