ray-project / ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://ray.io
Apache License 2.0
32.04k stars 5.46k forks source link

[DOC] Vllm example is not work #45739

Open vincent-pli opened 1 month ago

vincent-pli commented 1 month ago

Description

make the vllm example with latest vllm version(v0.4.3) works, by follow the current example from https://docs.ray.io/en/master/serve/tutorials/vllm-example.html I got exception:

AttributeError: 'list' object has no attribute 'max_model_len'

cause by missing parameters in: https://github.com/ray-project/ray/blob/c4a87ee474041ab7286a41378f3f6db904e0e3c5/doc/source/serve/doc_code/vllm_openai_example.py#L53

the OpenAIServingChat requires ModelConfig as the second parameter

I will make a pr the fix it latter

Link

https://docs.ray.io/en/master/serve/tutorials/vllm-example.html

shivam-dubey-1 commented 3 weeks ago

Any updates on the fix?