merge_and_unload docs do not clarify behaviour for quantized base models

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache License 2.0

15.98k stars 1.56k forks source link

System Info

Who can help?

@BenjaminBossan could you add a note to the docs to explain the default behaviour, and also any work arounds (e.g. loading a base model and dequantizing and loading the adapter and then merging) for best performance? Thanks

https://huggingface.co/docs/peft/main/en/package_reference/lora#peft.LoraModel.merge_and_unload

Information

[X] The official example scripts
[ ] My own modified scripts

Tasks

[X] An officially supported task in the examples folder
[ ] My own task or dataset (give details below)

huggingface / peft

merge_and_unload docs do not clarify behaviour for quantized base models #2105

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior