Open DimadonDL opened 5 days ago
Hi @DimadonDL,
The main difference with your curl
request appears to be the lack of setting max_tokens
. I believe the current default max_tokens is 16 (example from vllm) if left unspecified, which is probably why you're getting such a short response. Can you try setting max_tokens
?
MODEL="llama-3.1-8b-instruct"
curl -s http://localhost:9000/v1/chat/completions -H 'Content-Type: application/json' -d '{
"model": "'${MODEL}'",
"messages": [{"role": "user", "content": "Why is the sky blue?"}],
"max_tokens": 4096
}' | jq
Hi @rmccorm4,
Thanks for the explanation. That makes sense. Is it possible to override the default in vllm. I have an application where I can not set the max_token.
Thanks for your help so far. ☺️👌
Hi,
i have installed Triton with vllm backend and also the OpenAI-Compatible Frontend for Triton Inference Server (Beta). The model is meta-llama/Llama-3.1-8B-Instruct. Now when I call the Endpoint for example like this:
The Response is:
As you can see the content is cropped. I have played with the config but I don't know what's the problem. With Python the response is fine:
The response here is:
The sky appears blue due to a phenomenon called Rayleigh scattering. This is a scientific explanation:
Sunlight and the Atmosphere: When sunlight enters the Earth's atmosphere, it encounters tiny molecules of gases such as nitrogen (N2) and oxygen (O2). These molecules are much smaller than the wavelength of light.
Scattering of Light: According to the Rayleigh scattering theory, when light travels through the atmosphere, it encounters these tiny molecules. The shorter (blue) wavelengths of light are scattered more than the longer (red) wavelengths. This scattering of light in all directions is what gives the sky its blue color.
Blue Light Dominates: Due to the scattering effect, the blue light is distributed throughout the atmosphere, reaching our eyes from all directions. As a result, the sky appears blue. This is why we see a blue sky during the daytime.
Time of Day and Atmospheric Conditions: The color of the sky can change depending on the time of day and atmospheric conditions. During sunrise and sunset, the light has to travel longer distances through the atmosphere, which scatters the shorter wavelengths even more, making the sky appear red or orange. On a cloudy day, the scattered light is blocked, making the sky appear gray or white.
In summary, the sky appears blue due to the scattering of sunlight by the tiny molecules in the atmosphere, with blue light being scattered more than other colors.
My model.json is:
I have the same problem with 4096 max_model_len.
It would be great if someone can help me here.
Hardware: 4 GPUS NVIDIA L4 with 96 GB VRAM.
Thanks 👍