runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
213 stars 82 forks source link

Multi-LoRA #60

Open joaomsimoes opened 5 months ago

joaomsimoes commented 5 months ago

Any update on when this feature will be available? Thanks