InftyAI / llmaz

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
Apache License 2.0
31 stars 10 forks source link

Lora multiplexing support #27

Open kerthcet opened 4 months ago

kerthcet commented 4 months ago

/kind feature /kind enhancement

kerthcet commented 3 months ago

/milestone v0.2.0

kerthcet commented 3 months ago

/kind api-change

kerthcet commented 3 months ago

Some community docs: https://docs.google.com/document/d/1sFNHQqUWm1DIzC9GxXp3cKRm8cUtTcGuwZYkjkOkUqk/edit#heading=h.9brozdsx9dqo