runpod-workers / worker-vllm

The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
MIT License
220 stars 85 forks source link

Chat Template Feature, Message List, Small Refactor #27

Closed alpayariyak closed 8 months ago