Open jayteaftw opened 9 months ago
@jayteaftw Agree with you
Might be super helpful
mark, I am adding this feature.
hello,if this problem is fixed now. Indeed, when I need to update the lora model, I must stop the server, and add the new lora model and start the server again. what a stupid operation ! if lora model can be added to online server, the problem can be solved.
Hi,
I was reading through the documentation for Using Lora in VLLM.
In the documentation when they start the server, it looks like they have to specify which Lora modules are available
--lora-modules sql-lora=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/
Is it possible to do this in real-time instead? That is, start the server and call a recently added Lora module without having to stop and restart the server?