runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
242 stars 97 forks source link

[Update] Docs, bug fix. #109

Closed pandyamarut closed 2 months ago