anarchy-ai / LLM-VM

irresponsible innovation. Try now at https://chat.dev/
https://anarchy.ai/
MIT License
465 stars 150 forks source link

Load-balancing / auto-scaling for LLM serving on Google Cloud #379

Open Aryan8912 opened 8 months ago

Aryan8912 commented 8 months ago

close #375

mmirman commented 8 months ago

This appears to also include a commit from #378 and #374, please clean the commit tree before creating PRs.