Possible fix of wrong scale in weight decomposition #16151

KohakuBlueleaf · 2024-07-05T10:57:36Z

Description

Should resolve this: comfyanonymous/ComfyUI#3922

Checklist:

I have read contributing wiki page
I have performed a self-review of my own code
My code follows the style guidelines
My code passes tests

ntc-ai · 2024-07-05T17:03:39Z

Hi, this patch is wrong.

This variable is used by model authors to normalize the trained LoRA/DoRA from -1 to 1. With this patch the DoRA breaks when alpha changes instead of normalizing.

More: KohakuBlueleaf/LyCORIS#196

KohakuBlueleaf · 2024-07-05T17:05:42Z

it breaks bcuz it is not how alpha works.
I'm maintaining correct math formula.
Not maintaining what you want to do.
You can create an extension to achieve your goal.

ntc-ai · 2024-07-05T17:21:26Z

What is this useful for?
Are there references to the correct math formula?
There is no alpha in the DoRA whitepaper.

KohakuBlueleaf · 2024-07-05T17:23:32Z

What is this useful for? Are there references to the correct math formula? There is no alpha in the DoRA whitepaper.

wrong formula: W + scale * alpha/dim * (wd(W + BA) - W)
correct formula: W + scale * (wd(W + alpha/dim * BA) - W)

I think this is very intuitive that alpha/dim should not affect the "- W" part.

KohakuBlueleaf · 2024-07-05T17:24:00Z

Also that's how training of DoRA in my repo works.
I just reproduce the same formula of it.

ntc-ai · 2024-07-06T18:37:17Z

Hi, this PR should be reverted. Here are the reasons why:

With this PR, changing alpha breaks the model, even setting it to zero. This is different than how LoRAs work where alpha=0 turns off the LoRA.
There is no alpha in the whitepaper
This PR breaks existing trained DoRAs that have modified alpha based on the original code
alpha is not trainable and (alpha/dim) is 1.0 in most cases
I brought attention to an issue in the referenced implementation, not a1111. This code was correct before the changes.

KohakuBlueleaf · 2024-07-06T21:37:08Z

Hi, this PR should be reverted. Here are the reasons why:

With this PR, changing alpha breaks the model, even setting it to zero. This is different than how LoRAs work where alpha=0 turns off the LoRA.

There is no alpha in the whitepaper

This PR breaks existing trained DoRAs that have modified alpha based on the original code

alpha is not trainable and (alpha/dim) is 1.0 in most cases

I brought attention to an issue in the referenced implementation, not a1111. This code was correct before the changes.

That's not how alpha works. DoRA never ensure it will not affect anything when BA part is all zero
You should never modified alpha after training for all the models. If you want to do any special weight scaling. modify them with your custom code and equation.
This code was NOT correct before the changes. Already give you the formula.

ntc-ai · 2024-07-07T00:44:18Z

This code has been live for 4 months. The change you're implementing breaks functionality for anyone who has been using it during this time.
You are making a breaking change without anyone requesting it - in fact I'm requesting the opposite!
There's no external reference or evidence supporting your 'correct' formula. It seems to be based solely on your own intuition.
Several important points and evidence towards the prior solution being correct remain unaddressed.
In this formulation, alpha has no use. The best case I see is for gradient scaling but there are better ways to do that.

I hope you can understand the impact of these changes on users.

If you have the 'special weight scaling' code, feel free to share it. I don't think its possible in this formulation.

possible fix of wrong scale

bfbca31

comfyanonymous/ComfyUI#3922

KohakuBlueleaf requested a review from AUTOMATIC1111 as a code owner July 5, 2024 10:57

KohakuBlueleaf mentioned this pull request Jul 5, 2024

DoRA cross-compatibility KohakuBlueleaf/LyCORIS#196

Closed

AUTOMATIC1111 approved these changes Jul 6, 2024

View reviewed changes

AUTOMATIC1111 merged commit 372a8e0 into dev Jul 6, 2024
6 checks passed

AUTOMATIC1111 deleted the dora-fix branch July 6, 2024 06:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible fix of wrong scale in weight decomposition #16151

Possible fix of wrong scale in weight decomposition #16151

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024 •

edited

Loading

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024 •

edited

Loading

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 6, 2024 •

edited

Loading

KohakuBlueleaf commented Jul 6, 2024 •

edited

Loading

ntc-ai commented Jul 7, 2024 •

edited

Loading

Possible fix of wrong scale in weight decomposition #16151

Possible fix of wrong scale in weight decomposition #16151

Conversation

KohakuBlueleaf commented Jul 5, 2024

Description

Checklist:

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024 • edited Loading

ntc-ai commented Jul 5, 2024

KohakuBlueleaf commented Jul 5, 2024 • edited Loading

KohakuBlueleaf commented Jul 5, 2024

ntc-ai commented Jul 6, 2024 • edited Loading

KohakuBlueleaf commented Jul 6, 2024 • edited Loading

ntc-ai commented Jul 7, 2024 • edited Loading

KohakuBlueleaf commented Jul 5, 2024 •

edited

Loading

KohakuBlueleaf commented Jul 5, 2024 •

edited

Loading

ntc-ai commented Jul 6, 2024 •

edited

Loading

KohakuBlueleaf commented Jul 6, 2024 •

edited

Loading

ntc-ai commented Jul 7, 2024 •

edited

Loading