After Training, upload to HuggingFace #24

the3d4nk · 2023-05-18T18:54:43Z

the3d4nk
May 18, 2023

After successfully training, how would I use the LLaMa based model in HuggingFace? I pushed the contents of the lora_models folder which I uniquely labeled but it is apparently missing the base model in order to successfully use an inference API?

zetavg · 2023-05-19T08:41:47Z

zetavg
May 19, 2023
Maintainer

To my knowledge, the inference API does not support adapter models. You might need to use .merge_and_unload to merge the LoRA model to the base model and upload the merged model.

model = AutoModelForCausalLM.from_pretrained(
    base_model_name_or_path, device_map='auto')

model = PeftModel.from_pretrained(
    model,
    lora_model_name_or_path,
    device_map='auto'
)

model = model.merge_and_unload()  # Needs peft>=0.3.0

model.push_to_hub(model_name)

# For the inference API to work, we need to push the tokenizer too.
tokenizer.push_to_hub(model_name)

0 replies

the3d4nk · 2023-05-23T20:42:30Z

the3d4nk
May 23, 2023
Author

This is very helpful. Thank you!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After Training, upload to HuggingFace #24

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

After Training, upload to HuggingFace #24

the3d4nk May 18, 2023

Replies: 2 comments

zetavg May 19, 2023 Maintainer

the3d4nk May 23, 2023 Author

the3d4nk
May 18, 2023

zetavg
May 19, 2023
Maintainer

the3d4nk
May 23, 2023
Author