substratusai / lingo

Lightweight ML model proxy and autoscaler for kubernetes
https://www.substratus.ai
Apache License 2.0
96 stars 6 forks source link

add flash attention in vLLM helm chart #99

Closed samos123 closed 2 months ago