Quantization aware finetuning?

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

https://arxiv.org/abs/2305.14314

MIT License

10.08k stars 823 forks source link

Open SinanAkkoyun opened 1 year ago

SinanAkkoyun commented 1 year ago

This way one could hopefully eliminate quantization errors even further