[LoRA] introduce LoraBaseMixin to promote reusability. #8774

sayakpaul · 2024-07-03T01:37:33Z

What does this PR do?

It is basically a mirror of #8670. I had accidentally merged it but I have reverted it in #8773. Apologies for this.

I have made comments in line to address the questions brought up by @yiyixuxu.

src/diffusers/loaders/lora.py

HuggingFaceDocBuilderDev · 2024-07-03T01:43:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2024-07-03T08:41:27Z

Nice initiative 👍🏽 . A lot to unpack here, so perhaps it's best to start bit by bit. I just went over the pipeline related components here.

Regarding the LoraBaseMixin, at the moment I think it might be doing a bit too much.

There are quite a few methods in there that are making assumptions about the inheriting class using the method, which isn't really how a base class should behave. So loading methods related to specific model components are better left out e.g. load_lora_into_text_encoder. If this method is used across different pipelines with no changes, then it's better to create a utility function that does this and call it from the inheriting class. Or redefine the method in the inheriting class and use copied from.

I would assume that these are the methods that need to be defined for managing LoRAs across all pipelines?

class LoraBaseMixin:

    @classmethod
    def _optionally_disable_offloading(cls, _pipeline):
        raise NotImplementedError()

    @classmethod
    def _fetch_state_dict(
        cls,
        pretrained_model_name_or_path_or_dict,
        weight_name,
        use_safetensors,
        local_files_only,
        cache_dir,
        force_download,
        resume_download,
        proxies,
        token,
        revision,
        subfolder,
        user_agent,
        allow_pickle,
    ):
        raise NotImplementedError()

    @classmethod
    def _best_guess_weight_name(
        cls, pretrained_model_name_or_path_or_dict, file_extension=".safetensors", local_files_only=False
    ):
        return NotImplementedError()

    @classmethod
    def save_lora_weights(cls, **kwargs):
        raise NotImplementedError("`save_lora_weights()` not implemented.")

    @classmethod
    def lora_state_dict(cls, **kwargs):
        raise NotImplementedError("`lora_state_dict()` is not implemented.")

    def load_lora_weights(self, **kwargs):
        raise NotImplementedError("`load_lora_weights()` is not implemented.")

    def unload_lora_weights(self, **kwargs):
        raise NotImplementedError("`unload_lora_weights()` is not implemented.")

    def fuse_lora(self, **kwargs):
        raise NotImplementedError("`fuse_lora()` is not implemented.")

    def unfuse_lora(self, **kwargs):
        raise NotImplementedError("`unfuse_lora()` is not implemented.")

    def disable_lora(self):
        raise NotImplementedError("`disable_lora()` is not implemented.")

    def enable_lora(self):
        raise NotImplementedError("`unfuse_lora()` is not implemented.")

    def get_active_adapters(self):
        raise NotImplementedError("`delete_adapters()` is not implemented.")

    def delete_adapters(self, adapter_names):
        raise NotImplementedError("`delete_adapters()` is not implemented.")

    def set_lora_device(self, adapter_names):
        raise NotImplementedError("`delete_adapters()` is not implemented.")

    @staticmethod
    def pack_weights(layers, prefix):
        raise NotImplementedError()

    @staticmethod
    def write_lora_layers(
        state_dict: Dict[str, torch.Tensor],
        save_directory: str,
        is_main_process: bool,
        weight_name: str,
        save_function: Callable,
        safe_serialization: bool,
    ):
        raise NotImplementedError()

    @property
    def lora_scale(self) -> float:
        raise NotImplementedError()

Quite a few of these methods probably cannot be defined in the base class, such as load_lora_weights and unload_lora_weights, fuse_lora and unfuse_lora, since they deal with specific pipeline components
They might also require arguments specific to the pipeline type or pipeline components.

I think it might be better to define these methods in a pipeline specific class that inherits from the LoraBaseMixin. Or just as it's own Mixin class. I don't have a strong feeling about either approach. e.g. StableDiffusionLoraLoaderMixin could look like:

class StableDiffusionLoraLoaderMixin(LoraBaseMixin):
    _lora_loadable_modules = ["unet", "text_encoder"]

    def load_lora_weights(
        self,
        pretrained_model_name_or_path_or_dict: Union[str, Dict[str, torch.Tensor]],
        adapter_name: Optional[str] = None,
        **kwargs,
    ):
        _load_lora_into_unet(**kwargs)
        _load_lora_into_text_encoder(**kwargs)

    def fuse_lora(self, components=["unet", "text_encoder"], **kwargs):
        for fuse_component in components:
            if fuse_component not in self._lora_loadable_modules:
                raise ValueError()

            model = getattr(self, fuse_component)
            # check if diffusers model
            if issubclass(model, ModelMixin):
                model.fuse_lora()
            # handle transformers models. 
            if issubclass(model, PretrainedModel):
                fuse_text_encoder()

I saw this comment about using the term "fuse_denoiser" in the fusing methods. I'm not so sure about that. I think if we want to fuse the LoRA in a specific component, it's better to pass in the actual name of the component used in pipeline, rather than track another attribute such as denoiser

I also think the constants and class attributes such as TEXT_ENCODER_NAME and is_unet_denoiser might not be needed if we use a single class attribute with a list of the names of the lora loadable components.

sayakpaul · 2024-07-15T11:27:19Z

@DN6 as discussed over Slack, I have unified the PeftAdapterMixin class too so that we can have methods like fuse_lora(), delete_lora(), enable_lora(), etc. under one umbrella without having to define and copy-paste them for each model-specific loader mixins such as UNet2DConditionLoadersMixin.

One thing to note is that I had to still keep loaders/transformer_sd3.py to implement set_adapters() as this method varies from unet to transformer. This is because the block naming is different in these models. This is why you will also see set_adapters() in UNet2DConditionLoadersMixin.

We could have two additional classes under loaders/peft.py:

TransformerPeftAdapterMixin(PeftAdapterMixin)
UNet2DConditionPeftAdapterMixin(PeftAdapterMixin) to reimplement this method there and use them accordingly.

LMK.

sayakpaul · 2024-07-23T08:43:52Z

@DN6 I think this is ready for another review now.

src/diffusers/loaders/peft.py

DN6 · 2024-07-23T14:01:01Z

src/diffusers/loaders/peft.py

+        weights = [w if w is not None else 1.0 for w in weights]
+
+        # e.g. [{...}, 7] -> [{expanded dict...}, 7]
+        scale_expansion_fn = _SET_ADAPTER_SCALE_FN_MAPPING[self.__class__.__name__]


Let's just add a check in case this is applied to a model that doesn't exist in the mapping. Edge case, because we would probably always verify, but better to be safe.

if `scale_expansion_fn` is not None: ....

But scale_expansion_fn CANNOT be None no? We are directly indexing the dictionary here and not using get(). So, wrong indexing will anyway lead to an error. But LMK if I am missing something.

Actually on second thought, the check might be overkill. If we add to a model not in the mapping, we should error out.

What I was thinking is that SD3Transformer2DModel doesn't even need to be in the mapping. We use get to check if a scale_expansion_fn exists for a model class, and return None if it doesn't. Either approach works.

If we add to a model not in the mapping, we should error out.

Yeah this already works. So, I would prefer that.

sayakpaul · 2024-07-24T11:15:51Z

@DN6 anything else you would like me to address?

DN6

LGTM 👍🏽

sayakpaul · 2024-07-25T16:12:16Z

Thanks for the massive help and guidance, Dhruv!

This reverts commit 527430d.

Revert "[LoRA] introduce LoraBaseMixin to promote reusability. (#8774)" This reverts commit 527430d.

sayakpaul added 19 commits June 24, 2024 08:36

introduce to promote reusability.

b66885b

up

124b698

Merge branch 'main' into feat-lora-base-class

8828863

Merge branch 'main' into feat-lora-base-class

31bdbbf

add more tests

bb03165

Merge branch 'main' into feat-lora-base-class

d60445c

Merge branch 'main' into feat-lora-base-class

5ce5c8b

up

6448e92

up

865b94f

Merge branch 'main' into feat-lora-base-class

b68e385

remove comments.

4af4a8d

fix fuse_nan test

1c80fa7

Merge branch 'main' into feat-lora-base-class

1e5a5ed

clarify the scope of fuse_lora and unfuse_lora

6870915

resolve conflicts.

45404e9

remove space

6658771

Merge branch 'main' into feat-lora-base-class

0990909

Merge branch 'main' into feat-lora-base-class

13f46ea

Merge branch 'main' into feat-lora-base-class

6ccfc35

sayakpaul requested review from DN6 and yiyixuxu July 3, 2024 01:37

sayakpaul added 2 commits July 3, 2024 07:07

Merge branch 'main' into feat-lora-base-class

f765c0b

Merge branch 'main' into feat-lora-base-class

d84c51a

sayakpaul commented Jul 3, 2024

View reviewed changes

src/diffusers/loaders/lora.py Outdated Show resolved Hide resolved

sayakpaul commented Jul 3, 2024

View reviewed changes

src/diffusers/loaders/lora.py Outdated Show resolved Hide resolved

sayakpaul added 3 commits July 3, 2024 08:39

rewrite fuse_lora a bit.

b404a32

feedback

ad532a0

Merge branch 'main' into feat-lora-base-class

7e9b4e7

sayakpaul added 3 commits July 15, 2024 16:33

move lora to lora_pipeline.py

8d6db91

up

81414ec

fix-copies

af75abf

sayakpaul added 7 commits July 15, 2024 16:58

fix documentation.

6626824

comment set_adapters().

a17646a

resolve conflicts.

07fd364

fix-copies

11c6051

fix set_adapters() at the model level.

891260b

fix?

ab1926f

fix

0293578

DN6 reviewed Jul 23, 2024

View reviewed changes

src/diffusers/loaders/peft.py Show resolved Hide resolved

DN6 reviewed Jul 23, 2024

View reviewed changes

Merge branch 'main' into feat-lora-base-class

9c868b9

sayakpaul requested a review from DN6 July 23, 2024 15:03

Merge branch 'main' into feat-lora-base-class

aa67632

Merge branch 'main' into feat-lora-base-class

b5585d7

DN6 approved these changes Jul 25, 2024

View reviewed changes

sayakpaul merged commit 527430d into main Jul 25, 2024
18 checks passed

sayakpaul deleted the feat-lora-base-class branch July 25, 2024 16:11

yiyixuxu added a commit that referenced this pull request Jul 25, 2024

Revert "[LoRA] introduce LoraBaseMixin to promote reusability. (#8774)"

9d488da

This reverts commit 527430d.

yiyixuxu mentioned this pull request Jul 25, 2024

Revert "[LoRA] introduce LoraBaseMixin to promote reusability." #8976

Merged

yiyixuxu added a commit that referenced this pull request Jul 25, 2024

Revert "[LoRA] introduce LoraBaseMixin to promote reusability." (#8976)

62863bb

Revert "[LoRA] introduce LoraBaseMixin to promote reusability. (#8774)" This reverts commit 527430d.

yiyixuxu restored the feat-lora-base-class branch July 25, 2024 19:12

sayakpaul deleted the feat-lora-base-class branch July 26, 2024 01:40

sayakpaul restored the feat-lora-base-class branch July 26, 2024 01:40

sayakpaul mentioned this pull request Jul 26, 2024

[Chore] add LoraLoaderMixin to the inits #8981

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] introduce LoraBaseMixin to promote reusability. #8774

[LoRA] introduce LoraBaseMixin to promote reusability. #8774

sayakpaul commented Jul 3, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 3, 2024

DN6 commented Jul 3, 2024 •

edited

Loading

sayakpaul commented Jul 15, 2024

sayakpaul commented Jul 23, 2024

DN6 Jul 23, 2024 •

edited

Loading

sayakpaul Jul 23, 2024

DN6 Jul 24, 2024

DN6 Jul 24, 2024 •

edited

Loading

sayakpaul Jul 24, 2024

sayakpaul commented Jul 24, 2024

DN6 left a comment

sayakpaul commented Jul 25, 2024

[LoRA] introduce LoraBaseMixin to promote reusability. #8774

[LoRA] introduce LoraBaseMixin to promote reusability. #8774

Conversation

sayakpaul commented Jul 3, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 3, 2024

DN6 commented Jul 3, 2024 • edited Loading

sayakpaul commented Jul 15, 2024

sayakpaul commented Jul 23, 2024

DN6 Jul 23, 2024 • edited Loading

Choose a reason for hiding this comment

sayakpaul Jul 23, 2024

Choose a reason for hiding this comment

DN6 Jul 24, 2024

Choose a reason for hiding this comment

DN6 Jul 24, 2024 • edited Loading

Choose a reason for hiding this comment

sayakpaul Jul 24, 2024

Choose a reason for hiding this comment

sayakpaul commented Jul 24, 2024

DN6 left a comment

Choose a reason for hiding this comment

sayakpaul commented Jul 25, 2024

sayakpaul commented Jul 3, 2024 •

edited

Loading

DN6 commented Jul 3, 2024 •

edited

Loading

DN6 Jul 23, 2024 •

edited

Loading

DN6 Jul 24, 2024 •

edited

Loading