InftyAI / llmaz

☸️ Easy, advanced inference platform for large language models on Kubernetes
Apache License 2.0
13 stars 5 forks source link

Lora multiplexing support #27

Open kerthcet opened 1 month ago

kerthcet commented 1 month ago

/kind feature /kind enhancement

kerthcet commented 1 month ago

/milestone v0.2.0

kerthcet commented 1 month ago

/kind api-change

kerthcet commented 1 month ago

Some community docs: https://docs.google.com/document/d/1sFNHQqUWm1DIzC9GxXp3cKRm8cUtTcGuwZYkjkOkUqk/edit#heading=h.9brozdsx9dqo