artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs
https://arxiv.org/abs/2305.14314
MIT License
9.74k stars 800 forks source link

Qlora Read me fix #272

Open Vezora-Corp opened 9 months ago

Vezora-Corp commented 9 months ago

The read me qlora template gets memory error with default optimizer. Using "--optim adamw_bnb_8bit" fixes the error as for quant models. Updated read me: README.md