artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs
https://arxiv.org/abs/2305.14314
MIT License
10.06k stars 822 forks source link

Question about deployment of fine tuned model #289

Open Brandon371 opened 8 months ago

Brandon371 commented 8 months ago

I have used Qlora to fine tune the LLM model. I have seen some models on output folder. How can I run the output model and test them? I have tried to use some open source API. It seems requiring config.json to deploy this model on my server. How can I deploy the fine tuned model locally?