Closed FennFlyer closed 4 months ago
~Does the model fail upon startup? Otherwise, can you provide an example OpenAI API request that triggers this error?~
Can you try out #5214 and see if you get the same problem? The profile_run
logic should be fixed there.
Sure, do you have a recommended way to build the container? Just do the usual clone and Docker build on the branch or does your team have any build magic happening that I need to know about? Right now I'm just pulling straight from Docker Hub.
Sorry I missed this - I haven't used the Docker container myself, but from my understanding, you can use the Dockerfile from the main branch directly.
v0.5.1
has been released so you can directly use the official Docker image now.
Thank you, I was out on holiday last week so I will test the new image ASAP!
Your current environment
Current Environment
Docker image:
vllm/vllm-openai:v0.5.0.post1
Running as part of a Docker Compose stack. Relevant sections of my
docker-compose.yaml
are below. This is part of a multi-model deployment with other vLLM-based text generation/chat models running successfully behind a Traefik reverse proxy. I split out the instance running LLaVa 1.6 into its own service in thedocker-compose.yaml
to test the different commands it requires passed in on startup, it is the third service in the file. I have included the .env file entries as well.VLLM_IMAGE_MODEL_ID
points to a cloned Huggingface directory from https://huggingface.co/llava-hf/llava-v1.6-mistral-7b-hf (withtemplate_llava.jinja
added) that has directory structure:🐛 Describe the bug
Bug description
On starting the service with
docker compose --env-file .env.llava up reverseproxy vllm-llava-server
, it appears to do the usual startup, but then throws aValueError
, see below for full text and STDOUT. I have included all startup values that appear to be required when instantiating a newLLM
object from https://github.com/vllm-project/vllm/blob/main/examples/llava_example.py, am I missing something from mycommand
entry in thedocker-compose.yaml
?