After Training, upload to HuggingFace #24

the3d4nk · 2023-05-18T18:54:43Z

the3d4nk
May 18, 2023

After successfully training, how would I use the LLaMa based model in HuggingFace? I pushed the contents of the lora_models folder which I uniquely labeled but it is apparently missing the base model in order to successfully use an inference API?

zetavg · 2023-05-19T08:41:47Z

zetavg
May 19, 2023
Maintainer

To my knowledge, the inference API does not support adapter models. You might need to use .merge_and_unload to merge the LoRA model to the base model and upload the merged model.

model = AutoModelForCausalLM.from_pretrained(
    base_model_name_or_path, device_map='auto')

model = PeftModel.from_pretrained(
    model,
    lora_model_name_or_path,
    device_map='auto'
)

model = model.merge_and_unload()  # Needs peft>=0.3.0

model.push_to_hub(model_name)

# For the inference API to work, we need to push the tokenizer too.
tokenizer.push_to_hub(model_name)

0 replies

the3d4nk · 2023-05-23T20:42:30Z

the3d4nk
May 23, 2023
Author

This is very helpful. Thank you!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After Training, upload to HuggingFace #24

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

After Training, upload to HuggingFace #24

Uh oh!

the3d4nk May 18, 2023

Replies: 2 comments

Uh oh!

Uh oh!

zetavg May 19, 2023 Maintainer

Uh oh!

the3d4nk May 23, 2023 Author

the3d4nk
May 18, 2023

zetavg
May 19, 2023
Maintainer

the3d4nk
May 23, 2023
Author