[Performance]: under performing in comparision of sglang

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

https://docs.vllm.ai

Apache License 2.0

26.8k stars 3.93k forks source link

Open meetzuber opened 1 month ago

meetzuber commented 1 month ago

vLLm is under performing in comparison with sglang. There is something which need optimization for better performance.

No response

The output of `python collect_env.py`

felixzhu555 commented 1 month ago

See #6801!