runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
222 stars 86 forks source link

Update README.md #90

Closed pandyamarut closed 1 month ago