Implement options for selecting layers to apply LoRA to during training #163

viktorhargitai · 2023-06-11T23:08:48Z

Implements a feature for specifying the layers of the model that LoRA should be applied to during training. This enables further training memory use reductions, and additional QLoRA training experiments (including but not limited to the reproduction of relevant experiments from the QLoRA paper).

The layer selection is done with the lora_modules str argument. Its value can be:

a RegEx pattern for exact matching arbitrary layer names,
the default value or 'all' selects all linear transformer block layers (i.e. no difference from the last commit)
'attention', which selects the attention layers only,
'ffn', which selects the feed-forward ones.
(The latter 2 aim to reproduce ablation experiments from the paper, but work for e.g. Falcon models too, not just LLaMA.)

While the "lora_modules" argument was already in the guanaco training scripts, it was ignored because only the selection of all linear layers was implemented in this repo.

implement LoRA layer choice

3fe9865

viktorhargitai changed the title ~~Implement layer selection options for applying LoRA to during training~~ Implement options for selecting layers to apply LoRA to during training Jun 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement options for selecting layers to apply LoRA to during training #163

Implement options for selecting layers to apply LoRA to during training #163

viktorhargitai commented Jun 11, 2023

Implement options for selecting layers to apply LoRA to during training #163

Are you sure you want to change the base?

Implement options for selecting layers to apply LoRA to during training #163

Conversation

viktorhargitai commented Jun 11, 2023