vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
https://docs.vllm.ai
Apache License 2.0
27.8k stars 4.1k forks source link

[Bug]: CTRL+C Not Killing Process with distributed_executor_backend=mp in VLLM v0.5.3.post1 #7554

Open hahmad2008 opened 1 month ago

hahmad2008 commented 1 month ago

Your current environment

🐛 Describe the bug

When using distributed_executor_backend=mp with VLLM version vllm==0.5.3.post1, the process does not respond to CTRL+C in the terminal to terminate the process. This issue did not occur with vllm==0.5.0.post1.

  1. With tensor parallelism >1, the log shows that one process exits but another process on a different GPU continues to run. The terminal is unresponsive to CTRL+C signals in this case. (VllmWorkerProcess pid=9486) INFO 08-15 11:37:35 multiproc_worker_utils.py:237] Worker exiting

  2. With tensor parallelism ==1, the terminal does not respond to the CTRL+C signal at all.

youkaichao commented 1 month ago

cc @njhill

KuntaiDu commented 1 month ago

Temporary fix: try kill vllm by these lines of command:

ps aux | grep python | grep openai | awk '{print $2}' | xargs -r kill -9
ps -e | grep pt_main_thread | awk '{print $1}' | xargs kill -9