runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
213 stars 82 forks source link

A new version of VLLM has been released #84

Open d4rk6un opened 1 month ago

d4rk6un commented 1 month ago

A new version(0.5.1) of VLLM has been released, could you please update it to work with runpod serverless?

https://github.com/vllm-project/vllm/releases

nerdylive123 commented 1 month ago

Duplicate issue with: this #83