Open zxdvd opened 4 months ago
And when trying to debug it with export VLLM_TRACE_FUNCTION=1, it didn't crash. After unset it, it crashed again.
that's very strange ...
Python version: 3.11.0rc1 (main, Aug 12 2022, 10:02:14) [GCC 11.2.0] (64-bit runtime)
One thing I notice is you are using a release candidate version of Python. Do you try to switch between several python versions? e.g. official release of Python 3.10 / 3.11 ?
And when trying to debug it with export VLLM_TRACE_FUNCTION=1, it didn't crash. After unset it, it crashed again.
that's very strange ...
Python version: 3.11.0rc1 (main, Aug 12 2022, 10:02:14) [GCC 11.2.0] (64-bit runtime)
One thing I notice is you are using a release candidate version of Python. Do you try to switch between several python versions? e.g. official release of Python 3.10 / 3.11 ?
OK. I'll try with python 3.11 release. The current 3.11 rc1 is from image nvidia/cuda:12.1.0-devel-ubuntu22.04
.
I switched to python 3.10 and didn't reproduce crash.
I encountered this issue in Python 3.9 as well.
From my testing, it appears to be random; sometimes it core dumps, and sometimes it doesn't.
This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!
Your current environment
🐛 Describe the bug
The vllm is compiled from latest source code(commit af9ad46fca). It worked without problem for other model like
opt-125m
but always crashed with deepseek coder v2 lite.And when trying to debug it with
export VLLM_TRACE_FUNCTION=1
, it didn't crash. After unset it, it crashed again.I'm trying to debug the core with
gdb python <THE_CORE_FILE>
but didn't find something useful. Anyone can help me about how to get more information from the core file?Following is code to reproduce.