runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
213 stars 82 forks source link

Update to vllm 0.5 #80

Closed Sapessii closed 3 weeks ago

Sapessii commented 1 month ago

Hi

Can you please update the version to use VLLM v0.5 please?

Thank you