runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
220 stars 85 forks source link

Update for vllm 0.2.0 #9

Closed kenny019 closed 10 months ago

kenny019 commented 11 months ago

vllm released a new version, could we get this updated to 0.2.0? https://github.com/vllm-project/vllm/releases/tag/v0.2.0