InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
https://lmdeploy.readthedocs.io/en/latest/
Apache License 2.0
4.61k stars 421 forks source link

[Bug] 访问一段时间后服务卡死/无响应 #2001

Open hezeli123 opened 4 months ago

hezeli123 commented 4 months ago

Checklist

Describe the bug

单卡A100上跑qwenvl-chat模型,使用32k的session-len跑一段时间请求后,服务失去响应。

Reproduction

1、lmdeploy serve api_server Qwen-VL-Chat --server-port 80 --session-len 32768 2、使用client跑一段时间后,服务失去响应,client端如果使用同步,没有设置超时时间的话,client一直没有响应返回

Environment

单卡A100
lmdeploy check_env
/usr/local/lib/python3.10/dist-packages/gradio_client/documentation.py:103: UserWarning: Could not get documentation group for <class 'gradio.mix.Parallel'>: No known documentation group for module 'gradio.mix'
  warnings.warn(f"Could not get documentation group for {cls}: {exc}")
/usr/local/lib/python3.10/dist-packages/gradio_client/documentation.py:103: UserWarning: Could not get documentation group for <class 'gradio.mix.Series'>: No known documentation group for module 'gradio.mix'
  warnings.warn(f"Could not get documentation group for {cls}: {exc}")
sys.platform: linux
Python: 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0]
CUDA available: True
MUSA available: False
numpy_random_seed: 2147483648
GPU 0: NVIDIA A100-SXM4-80GB
CUDA_HOME: /usr/local/cuda
NVCC: Cuda compilation tools, release 12.3, V12.3.107
GCC: x86_64-linux-gnu-gcc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0
PyTorch: 2.2.2+cu121
PyTorch compiling details: PyTorch built with:
  - GCC 9.3
  - C++ Version: 201703
  - Intel(R) oneAPI Math Kernel Library Version 2022.2-Product Build 20220804 for Intel(R) 64 architecture applications
  - Intel(R) MKL-DNN v3.3.2 (Git Hash 2dc95a2ad0841e29db8b22fbccaf3e5da7992b01)
  - OpenMP 201511 (a.k.a. OpenMP 4.5)
  - LAPACK is enabled (usually provided by MKL)
  - NNPACK is enabled
  - CPU capability usage: AVX512
  - CUDA Runtime 12.1
  - NVCC architecture flags: -gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_86,code=sm_86;-gencode;arch=compute_90,code=sm_90
  - CuDNN 8.9.2
  - Magma 2.6.1
  - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, CUDA_VERSION=12.1, CUDNN_VERSION=8.9.2, CXX_COMPILER=/opt/rh/devtoolset-9/root/usr/bin/c++, CXX_FLAGS= -D_GLIBCXX_USE_CXX11_ABI=0 -fabi-version=11 -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOROCTRACER -DUSE_FBGEMM -DUSE_QNNPACK -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-type-limits-Wno-array-bounds -Wno-unknown-pragmas -Wno-unused-parameter -Wno-unused-function -Wno-unused-result -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=pedantic -Wno-error=old-style-cast -Wno-missing-braces -fdiagnostics-color=always -faligned-new -Wno-unused-but-set-variable -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, PERF_WITH_AVX512=1, TORCH_VERSION=2.2.2, USE_CUDA=ON, USE_CUDNN=ON, USE_EXCEPTION_PTR=1, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=1, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=OFF, USE_ROCM_KERNEL_ASSERT=OFF,

TorchVision: 0.17.2+cu121
LMDeploy: 0.4.2+
transformers: 4.40.2
gradio: 3.40.1
fastapi: 0.110.2
pydantic: 2.7.1
triton: 2.2.0

Error traceback

#0  0x00007f8a5a52fb68 in __GI_epoll_pwait (epfd=epfd@entry=42, events=events@entry=0x7fffb8baaba0, maxevents=maxevents@entry=1024, 
    timeout=timeout@entry=100, set=set@entry=0x0) at ../sysdeps/unix/sysv/linux/epoll_pwait.c:40
#1  0x00007f896cd2f23d in uv__io_poll (loop=0x557fda8f8ee0, timeout=100) at src/unix/linux.c:1359
#2  0x00007f896cd21ea6 in uv_run (loop=0x557fda8f8ee0, mode=UV_RUN_DEFAULT) at src/unix/core.c:447
#3  0x00007f896ccada83 in __pyx_f_6uvloop_4loop_4Loop___run (__pyx_v_self=0x557f04badca0, __pyx_v_mode=UV_RUN_DEFAULT)
    at uvloop/loop.c:15092
#4  0x00007f896ccb6b16 in __pyx_f_6uvloop_4loop_4Loop__run (__pyx_v_self=0x557f04badca0, __pyx_v_mode=UV_RUN_DEFAULT)
    at uvloop/loop.c:15471
#5  0x00007f896cc7c3c3 in __pyx_pf_6uvloop_4loop_4Loop_24run_forever (__pyx_v_self=0x557f04badca0) at uvloop/loop.c:28166
#6  __pyx_pw_6uvloop_4loop_4Loop_25run_forever (__pyx_v_self=0x557f04badca0, unused=<optimized out>) at uvloop/loop.c:27987
#7  0x00007f896cc48355 in __Pyx_PyObject_CallMethO (arg=0x0, func=0x7f899d838860) at uvloop/loop.c:171467
#8  __Pyx_PyObject_CallNoArg (func=0x7f899d838860) at uvloop/loop.c:40421
#9  0x00007f896cce128d in __pyx_pf_6uvloop_4loop_4Loop_44run_until_complete (__pyx_v_future=0x7f896ce38e10, 
    __pyx_v_self=<optimized out>) at uvloop/loop.c:30035
#10 __pyx_pw_6uvloop_4loop_4Loop_45run_until_complete (__pyx_v_self=<optimized out>, __pyx_v_future=<optimized out>)
    at uvloop/loop.c:29633
#11 0x0000557ebff992ca in ?? ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7bcfa in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#16 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#17 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#18 0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#19 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#20 0x0000557ebff9c492 in PyObject_Call ()
#21 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#22 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#23 0x0000557ebff7bcfa in _PyEval_EvalFrameDefault ()
#24 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#25 0x0000557ebff7626d in _PyEval_EvalFrameDefault ()
#26 0x0000557ebff729c6 in ?? ()
#27 0x0000557ec0068256 in PyEval_EvalCode ()
#28 0x0000557ec0093108 in ?? ()
#29 0x0000557ec008c9cb in ?? ()
#30 0x0000557ec0092e55 in ?? ()
#31 0x0000557ec0092338 in _PyRun_SimpleFileObject ()
#32 0x0000557ec0091f83 in _PyRun_AnyFileObject ()
#33 0x0000557ec0084a5e in Py_RunMain ()
#34 0x0000557ec005b02d in Py_BytesMain ()
#35 0x00007f8a5a433d90 in __libc_start_call_main (main=main@entry=0x557ec005aff0, argc=argc@entry=9, argv=argv@entry=0x7fffb8baef98)
    at ../sysdeps/nptl/libc_start_call_main.h:58
#36 0x00007f8a5a433e40 in __libc_start_main_impl (main=0x557ec005aff0, argc=9, argv=0x7fffb8baef98, init=<optimized out>, 
    fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7fffb8baef88) at ../csu/libc-start.c:392
#37 0x0000557ec005af25 in _start ()

Thread 31 (Thread 0x7f83fe7fc640 (LWP 248689) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f83a8478890) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x7f83a8478890) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f83a8478890, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x7f83a8478890, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x7f83a8478890, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x0000557ebffa4461 in ?? ()
#7  0x0000557ebff8e8b7 in ?? ()
#8  0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#16 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#17 0x0000557ebff9ba51 in ?? ()
#18 0x0000557ec00c4f3a in ?? ()
#19 0x0000557ec00b9ef8 in ?? ()
#20 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#21 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 30 (Thread 0x7f83feffd640 (LWP 238135) "lmdeploy"):
#0  0x00007f8a58218b31 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#1  0x00007f8a582163e0 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#2  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#3  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 29 (Thread 0x7f83ff7fe640 (LWP 238134) "lmdeploy"):
#0  0x00007f8a58218b31 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#1  0x00007f8a582163e0 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#2  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#3  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 28 (Thread 0x7f83fffff640 (LWP 238133) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 27 (Thread 0x7f841cff9640 (LWP 238132) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 26 (Thread 0x7f841d7fa640 (LWP 238131) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 25 (Thread 0x7f841dffb640 (LWP 238130) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 24 (Thread 0x7f841e7fc640 (LWP 238129) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129f72 in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 23 (Thread 0x7f841effd640 (LWP 238128) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129f72 in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 22 (Thread 0x7f841f7fe640 (LWP 238127) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129f72 in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 21 (Thread 0x7f841ffff640 (LWP 238126) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 20 (Thread 0x7f86d6ffd640 (LWP 238125) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129f72 in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 19 (Thread 0x7f86d77fe640 (LWP 238124) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 18 (Thread 0x7f86d7fff640 (LWP 238123) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129ddf in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 17 (Thread 0x7f87f2ffd640 (LWP 238122) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129f72 in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 16 (Thread 0x7f87f37fe640 (LWP 238121) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x0, clockid=0, expected=0, futex_word=0x557fda93c1b0) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557fda93c1b0, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a6bdf in do_futex_wait (sem=sem@entry=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a6c78 in __new_sem_wait_slow64 (sem=0x557fda93c1b0, abstime=0x0, clockid=0) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c5d0 in PyThread_acquire_lock_timed ()
#6  0x00007f8a5a129f72 in ?? () from /usr/lib/python3.10/lib-dynload/_queue.cpython-310-x86_64-linux-gnu.so
#7  0x0000557ec003f094 in ?? ()
#8  0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff9ba51 in ?? ()
#16 0x0000557ec00c4f3a in ?? ()
#17 0x0000557ec00b9ef8 in ?? ()
#18 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#19 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 15 (Thread 0x7f89408f9640 (LWP 238107) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f896cd8e728 <cond+40>) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f896cd8e728 <cond+40>) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f896cd8e728 <cond+40>, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f896cd8e6c0 <mutex>, cond=0x7f896cd8e700 <cond>) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f896cd8e700 <cond>, mutex=0x7f896cd8e6c0 <mutex>) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f896cd2acc9 in uv_cond_wait (cond=<optimized out>, mutex=<optimized out>) at src/unix/thread.c:793
#6  0x00007f896cd1d659 in worker (arg=0x0) at src/threadpool.c:76
#7  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#8  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 14 (Thread 0x7f89410fa640 (LWP 238106) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f896cd8e728 <cond+40>) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f896cd8e728 <cond+40>) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f896cd8e728 <cond+40>, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f896cd8e6c0 <mutex>, cond=0x7f896cd8e700 <cond>) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f896cd8e700 <cond>, mutex=0x7f896cd8e6c0 <mutex>) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f896cd2acc9 in uv_cond_wait (cond=<optimized out>, mutex=<optimized out>) at src/unix/thread.c:793
#6  0x00007f896cd1d659 in worker (arg=0x0) at src/threadpool.c:76
#7  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#8  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 13 (Thread 0x7f89418fb640 (LWP 238105) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f896cd8e728 <cond+40>) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f896cd8e728 <cond+40>) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f896cd8e728 <cond+40>, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f896cd8e6c0 <mutex>, cond=0x7f896cd8e700 <cond>) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f896cd8e700 <cond>, mutex=0x7f896cd8e6c0 <mutex>) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f896cd2acc9 in uv_cond_wait (cond=<optimized out>, mutex=<optimized out>) at src/unix/thread.c:793
#6  0x00007f896cd1d659 in worker (arg=0x0) at src/threadpool.c:76
#7  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#8  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 12 (Thread 0x7f89430fe640 (LWP 238104) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f896cd8e72c <cond+44>) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f896cd8e72c <cond+44>) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f896cd8e72c <cond+44>, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f896cd8e6c0 <mutex>, cond=0x7f896cd8e700 <cond>) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f896cd8e700 <cond>, mutex=0x7f896cd8e6c0 <mutex>) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f896cd2acc9 in uv_cond_wait (cond=<optimized out>, mutex=<optimized out>) at src/unix/thread.c:793
#6  0x00007f896cd1d659 in worker (arg=0x0) at src/threadpool.c:76
#7  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#8  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 11 (Thread 0x7f89420fc640 (LWP 237976) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f8439346f14) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f8439346f14) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f8439346f14, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f8439346ec0, cond=0x7f8439346ee8) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f8439346ee8, mutex=0x7f8439346ec0) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f8a58080747 in std::condition_variable::wait(std::unique_lock<std::mutex>&) () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#6  0x00007f899280361e in std::condition_variable::wait<turbomind::LlamaBatch<__nv_bfloat16>::OutputThreadEntry()::{lambda()#1}>(std::unique_lock<std::mutex>&, turbomind::LlamaBatch<__nv_bfloat16>::OutputThreadEntry()::{lambda()#1}) (__p=..., __lock=..., this=0x7f8439346ee8) at /opt/rh/devtoolset-9/root/usr/include/c++/9/condition_variable:101
#7  turbomind::LlamaBatch<__nv_bfloat16>::OutputThreadEntry (this=0x7f8439346ac0) at /lmdeploy/src/turbomind/models/llama/LlamaBatch.cc:1501
#8  0x00007f8a580b0253 in ?? () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#9  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#10 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 10 (Thread 0x7f89428fd640 (LWP 237975) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x557ed528ba34) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x557ed528ba34) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x557ed528ba34, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x557ed528b9e0, cond=0x557ed528ba08) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x557ed528ba08, mutex=0x557ed528b9e0) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f8a58080747 in std::condition_variable::wait(std::unique_lock<std::mutex>&) () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#6  0x00007f89928027f0 in std::condition_variable::wait<turbomind::RequestQueue::dequeue(std::vector<std::shared_ptr<turbomind::Request>, std::allocator<std::shared_ptr<turbomind::Request> > >&, std::vector<std::shared_ptr<turbomind::Request>, std::allocator<std::shared_ptr<turbomind::Request> > >&, unsigned int, bool, bool&)::{lambda()#1}>(std::unique_lock<std::mutex>&, turbomind::RequestQueue::dequeue(std::vector<std::shared_ptr<turbomind::Request>, std::allocator<std::shared_ptr<turbomind::Request> > >&, std::vector<std::shared_ptr<turbomind::Request>, std::allocator<std::shared_ptr<turbomind::Request> > >&, unsigned int, bool, bool&)::{lambda()#1}) (__p=..., __lock=..., this=0x557ed528ba08) at /opt/rh/devtoolset-9/root/usr/include/c++/9/condition_variable:101
#7  turbomind::RequestQueue::dequeue (this=this@entry=0x557ed528b940, stop_requests=std::vector of length 0, capacity 0, infer_requests=std::vector of length 0, capacity 1, max_infer_count=max_infer_count@entry=128, blocking=blocking@entry=true, abort=@0x557ed528ba50: false) at /lmdeploy/src/turbomind/models/llama/Request.h:77
#8  0x00007f89928321e0 in turbomind::LlamaBatch<__nv_bfloat16>::InternalThreadEntry (this=<optimized out>, device_id=<optimized out>) at /lmdeploy/src/turbomind/models/llama/LlamaBatch.cc:1409
#9  0x00007f8a580b0253 in ?? () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#10 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#11 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 9 (Thread 0x7f89438ff640 (LWP 237961) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=<optimized out>, cancel=true, abstime=0x7f89438fe330, op=137, expected=0, futex_word=0x7f8938263030) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=<optimized out>, abstime=0x7f89438fe330, clockid=0, expected=0, futex_word=0x7f8938263030) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f8938263030, expected=expected@entry=0, clockid=clockid@entry=1, abstime=abstime@entry=0x7f89438fe330, private=<optimized out>) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a4a5ed0 in do_futex_wait (sem=sem@entry=0x7f8938263030, clockid=clockid@entry=1, abstime=abstime@entry=0x7f89438fe330) at ./nptl/sem_waitcommon.c:111
#4  0x00007f8a5a4a5f7e in __new_sem_wait_slow64 (sem=0x7f8938263030, clockid=1, abstime=0x7f89438fe330) at ./nptl/sem_waitcommon.c:183
#5  0x0000557ebff4c62b in PyThread_acquire_lock_timed ()
#6  0x0000557ebffa4461 in ?? ()
#7  0x0000557ebff8e8b7 in ?? ()
#8  0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#9  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#10 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#11 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#16 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#17 0x0000557ebff9ba51 in ?? ()
#18 0x0000557ec00c4f3a in ?? ()
#19 0x0000557ec00b9ef8 in ?? ()
#20 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#21 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 8 (Thread 0x7f896e4dd640 (LWP 237946) "lmdeploy"):
#0  0x00007f8a5a52fe2e in epoll_wait (epfd=38, events=0x7f89439d9090, maxevents=1, timeout=-1) at ../sysdeps/unix/sysv/linux/epoll_wait.c:30
#1  0x0000557ec00e43d2 in ?? ()
#2  0x0000557ebffae55e in ?? ()
#3  0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#4  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#5  0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#6  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#7  0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#8  0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#9  0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#10 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#11 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#12 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#13 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#14 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#15 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#16 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#17 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#18 0x0000557ebff9ba51 in ?? ()
#19 0x0000557ec00c4f3a in ?? ()
#20 0x0000557ec00b9ef8 in ?? ()
#21 0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#22 0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 7 (Thread 0x7f896feff640 (LWP 237923) "cuda-EvtHandlr"):
#0  0x00007f8a5a522bcf in __GI___poll (fds=0x7f8964000c20, nfds=11, timeout=100) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f89adc33b3f in ?? () from /usr/local/cuda/compat/lib.real/libcuda.so.1
#2  0x00007f89adcf66cf in ?? () from /usr/local/cuda/compat/lib.real/libcuda.so.1
#3  0x00007f89adc2fd2f in ?? () from /usr/local/cuda/compat/lib.real/libcuda.so.1
#4  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#5  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 6 (Thread 0x7f89808fe640 (LWP 237922) "lmdeploy"):
#0  0x00007f8a58218b31 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#1  0x00007f8a582163e0 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#2  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#3  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 5 (Thread 0x7f89810ff640 (LWP 237921) "lmdeploy"):
#0  0x00007f8a58218b31 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#1  0x00007f8a582163e0 in ?? () from /usr/local/lib/python3.10/dist-packages/torch/lib/libgomp-a34b3233.so.1
#2  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#3  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 4 (Thread 0x7f898efe2640 (LWP 237920) "cuda00001400006"):
#0  0x00007f8a5a522bcf in __GI___poll (fds=0x557ec7d95f80, nfds=3, timeout=-1) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f89adc33b3f in ?? () from /usr/local/cuda/compat/lib.real/libcuda.so.1
#2  0x00007f89adcf66cf in ?? () from /usr/local/cuda/compat/lib.real/libcuda.so.1
#3  0x00007f89adc2fd2f in ?? () from /usr/local/cuda/compat/lib.real/libcuda.so.1
#4  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#5  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 3 (Thread 0x7f89a63fe640 (LWP 237918) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f89a947de60 <thread_status+224>) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f89a947de60 <thread_status+224>) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f89a947de60 <thread_status+224>, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f89a947de10 <thread_status+144>, cond=0x7f89a947de38 <thread_status+184>) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f89a947de38 <thread_status+184>, mutex=0x7f89a947de10 <thread_status+144>) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f89a78036fb in blas_thread_server () from /usr/local/lib/python3.10/dist-packages/numpy/core/../../numpy.libs/libopenblas64_p-r0-0cf96a72.3.23.dev.so
#6  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#7  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 2 (Thread 0x7f89a6bff640 (LWP 237917) "lmdeploy"):
#0  __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0, futex_word=0x7f89a947dde0 <thread_status+96>) at ./nptl/futex-internal.c:57
#1  __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0, futex_word=0x7f89a947dde0 <thread_status+96>) at ./nptl/futex-internal.c:87
#2  __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0x7f89a947dde0 <thread_status+96>, expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0, private=private@entry=0) at ./nptl/futex-internal.c:139
#3  0x00007f8a5a49da41 in __pthread_cond_wait_common (abstime=0x0, clockid=0, mutex=0x7f89a947dd90 <thread_status+16>, cond=0x7f89a947ddb8 <thread_status+56>) at ./nptl/pthread_cond_wait.c:503
#4  ___pthread_cond_wait (cond=0x7f89a947ddb8 <thread_status+56>, mutex=0x7f89a947dd90 <thread_status+16>) at ./nptl/pthread_cond_wait.c:627
#5  0x00007f89a78036fb in blas_thread_server () from /usr/local/lib/python3.10/dist-packages/numpy/core/../../numpy.libs/libopenblas64_p-r0-0cf96a72.3.23.dev.so
#6  0x00007f8a5a49eac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#7  0x00007f8a5a52fa04 in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:100

Thread 1 (Thread 0x7f8a5a405740 (LWP 237915) "lmdeploy"):
#0  0x00007f8a5a52fb68 in __GI_epoll_pwait (epfd=epfd@entry=42, events=events@entry=0x7fffb8baaba0, maxevents=maxevents@entry=1024, timeout=timeout@entry=100, set=set@entry=0x0) at ../sysdeps/unix/sysv/linux/epoll_pwait.c:40
#1  0x00007f896cd2f23d in uv__io_poll (loop=0x557fda8f8ee0, timeout=100) at src/unix/linux.c:1359
#2  0x00007f896cd21ea6 in uv_run (loop=0x557fda8f8ee0, mode=UV_RUN_DEFAULT) at src/unix/core.c:447
#3  0x00007f896ccada83 in __pyx_f_6uvloop_4loop_4Loop___run (__pyx_v_self=0x557f04badca0, __pyx_v_mode=UV_RUN_DEFAULT) at uvloop/loop.c:15092
#4  0x00007f896ccb6b16 in __pyx_f_6uvloop_4loop_4Loop__run (__pyx_v_self=0x557f04badca0, __pyx_v_mode=UV_RUN_DEFAULT) at uvloop/loop.c:15471
#5  0x00007f896cc7c3c3 in __pyx_pf_6uvloop_4loop_4Loop_24run_forever (__pyx_v_self=0x557f04badca0) at uvloop/loop.c:28166
#6  __pyx_pw_6uvloop_4loop_4Loop_25run_forever (__pyx_v_self=0x557f04badca0, unused=<optimized out>) at uvloop/loop.c:27987
#7  0x00007f896cc48355 in __Pyx_PyObject_CallMethO (arg=0x0, func=0x7f899d838860) at uvloop/loop.c:171467
#8  __Pyx_PyObject_CallNoArg (func=0x7f899d838860) at uvloop/loop.c:40421
#9  0x00007f896cce128d in __pyx_pf_6uvloop_4loop_4Loop_44run_until_complete (__pyx_v_future=0x7f896ce38e10, __pyx_v_self=<optimized out>) at uvloop/loop.c:30035
#10 __pyx_pw_6uvloop_4loop_4Loop_45run_until_complete (__pyx_v_self=<optimized out>, __pyx_v_future=<optimized out>) at uvloop/loop.c:29633
#11 0x0000557ebff992ca in ?? ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7bcfa in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#16 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#17 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#18 0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#19 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#20 0x0000557ebff9c492 in PyObject_Call ()
#21 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#22 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#23 0x0000557ebff7bcfa in _PyEval_EvalFrameDefault ()
#24 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#25 0x0000557ebff7626d in _PyEval_EvalFrameDefault ()
#26 0x0000557ebff729c6 in ?? ()
#27 0x0000557ec0068256 in PyEval_EvalCode ()
#28 0x0000557ec0093108 in ?? ()
#29 0x0000557ec008c9cb in ?? ()
#30 0x0000557ec0092e55 in ?? ()
#31 0x0000557ec0092338 in _PyRun_SimpleFileObject ()
#32 0x0000557ec0091f83 in _PyRun_AnyFileObject ()
#33 0x0000557ec0084a5e in Py_RunMain ()
#34 0x0000557ec005b02d in Py_BytesMain ()
#35 0x00007f8a5a433d90 in __libc_start_call_main (main=main@entry=0x557ec005aff0, argc=argc@entry=9, argv=argv@entry=0x7fffb8baef98) at ../sysdeps/nptl/libc_start_call_main.h:58
#36 0x00007f8a5a433e40 in __libc_start_main_impl (main=0x557ec005aff0, argc=9, argv=0x7fffb8baef98, init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7fffb8baef88) at ../csu/libc-start.c:392
#37 0x0000557ec005af25 in _start ()
Program not restarted.
Continuing.
[New Thread 0x7f87f3fff640 (LWP 271082)]
[Thread 0x7f87f3fff640 (LWP 271082) exited]

Thread 1 "lmdeploy" received signal SIGINT, Interrupt.
0x00007f8a5a52fb68 in __GI_epoll_pwait (epfd=epfd@entry=42, events=events@entry=0x7fffb8baaba0, maxevents=maxevents@entry=1024, timeout=timeout@entry=39, set=set@entry=0x0) at ../sysdeps/unix/sysv/linux/epoll_pwait.c:40
40  in ../sysdeps/unix/sysv/linux/epoll_pwait.c
#0  0x00007f8a5a52fb68 in __GI_epoll_pwait (epfd=epfd@entry=42, events=events@entry=0x7fffb8baaba0, maxevents=maxevents@entry=1024, 
    timeout=timeout@entry=39, set=set@entry=0x0) at ../sysdeps/unix/sysv/linux/epoll_pwait.c:40
#1  0x00007f896cd2f23d in uv__io_poll (loop=0x557fda8f8ee0, timeout=39) at src/unix/linux.c:1359
#2  0x00007f896cd21ea6 in uv_run (loop=0x557fda8f8ee0, mode=UV_RUN_DEFAULT) at src/unix/core.c:447
#3  0x00007f896ccada83 in __pyx_f_6uvloop_4loop_4Loop___run (__pyx_v_self=0x557f04badca0, __pyx_v_mode=UV_RUN_DEFAULT)
    at uvloop/loop.c:15092
#4  0x00007f896ccb6b16 in __pyx_f_6uvloop_4loop_4Loop__run (__pyx_v_self=0x557f04badca0, __pyx_v_mode=UV_RUN_DEFAULT)
    at uvloop/loop.c:15471
#5  0x00007f896cc7c3c3 in __pyx_pf_6uvloop_4loop_4Loop_24run_forever (__pyx_v_self=0x557f04badca0) at uvloop/loop.c:28166
#6  __pyx_pw_6uvloop_4loop_4Loop_25run_forever (__pyx_v_self=0x557f04badca0, unused=<optimized out>) at uvloop/loop.c:27987
#7  0x00007f896cc48355 in __Pyx_PyObject_CallMethO (arg=0x0, func=0x7f899d838860) at uvloop/loop.c:171467
#8  __Pyx_PyObject_CallNoArg (func=0x7f899d838860) at uvloop/loop.c:40421
#9  0x00007f896cce128d in __pyx_pf_6uvloop_4loop_4Loop_44run_until_complete (__pyx_v_future=0x7f896ce38e10, 
    __pyx_v_self=<optimized out>) at uvloop/loop.c:30035
#10 __pyx_pw_6uvloop_4loop_4Loop_45run_until_complete (__pyx_v_self=<optimized out>, __pyx_v_future=<optimized out>)
    at uvloop/loop.c:29633
#11 0x0000557ebff992ca in ?? ()
#12 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#13 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#14 0x0000557ebff7bcfa in _PyEval_EvalFrameDefault ()
#15 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#16 0x0000557ebff7645c in _PyEval_EvalFrameDefault ()
#17 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#18 0x0000557ebff7753c in _PyEval_EvalFrameDefault ()
#19 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#20 0x0000557ebff9c492 in PyObject_Call ()
#21 0x0000557ebff785d7 in _PyEval_EvalFrameDefault ()
#22 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#23 0x0000557ebff7bcfa in _PyEval_EvalFrameDefault ()
#24 0x0000557ebff8d9fc in _PyFunction_Vectorcall ()
#25 0x0000557ebff7626d in _PyEval_EvalFrameDefault ()
#26 0x0000557ebff729c6 in ?? ()
#27 0x0000557ec0068256 in PyEval_EvalCode ()
#28 0x0000557ec0093108 in ?? ()
#29 0x0000557ec008c9cb in ?? ()
#30 0x0000557ec0092e55 in ?? ()
#31 0x0000557ec0092338 in _PyRun_SimpleFileObject ()
#32 0x0000557ec0091f83 in _PyRun_AnyFileObject ()
#33 0x0000557ec0084a5e in Py_RunMain ()
#34 0x0000557ec005b02d in Py_BytesMain ()
#35 0x00007f8a5a433d90 in __libc_start_call_main (main=main@entry=0x557ec005aff0, argc=argc@entry=9, argv=argv@entry=0x7fffb8baef98)
    at ../sysdeps/nptl/libc_start_call_main.h:58
#36 0x00007f8a5a433e40 in __libc_start_main_impl (main=0x557ec005aff0, argc=9, argv=0x7fffb8baef98, init=<optimized out>, 
    fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7fffb8baef88) at ../csu/libc-start.c:392
#37 0x0000557ec005af25 in _start ()
Detaching from program: /usr/bin/python3.10, process 237915
hezeli123 commented 4 months ago

期间没有发送stop的请求

irexyc commented 4 months ago

--log-level INFO,然后服务端的日志是怎样的呢?

hezeli123 commented 4 months ago

日志如下,有收到请求后的图像下载信息,后续没有LLM推理相关的日志。 2024-07-11 19:59:48,123 - lmdeploy - INFO - async_collect_pil_images latency: 98.4154 ms 2024-07-11 19:59:48,123 - lmdeploy - INFO - ImageEncoder received 1 images, left 1 images. 2024-07-11 19:59:48,123 - lmdeploy - INFO - ImageEncoder process 1 images, left 0 images. 2024-07-11 19:59:48,185 - lmdeploy - INFO - ImageEncoder forward 1 images, cost 0.061s 2024-07-11 19:59:48,185 - lmdeploy - INFO - ImageEncoder done 1 images, left 0 images. 2024-07-11 19:59:48,222 - lmdeploy - INFO - prompt='<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\nPicture 0:\n后天迎来“大雪” 气温虽有回升但御寒保暖仍是养生重点\n\n潮新........\n\n<|im_end|>\n<|im_start|>assistant\n', gen_config=EngineGenerationConfig(n=1, max_new_tokens=1, top_p=1.0, top_k=1, temperature=1, repetition_penalty=1.0, ignore_eos=False, random_seed=15278339712589078680, stop_words=[151645], bad_words=None, min_new_tokens=None, skip_special_tokens=True, logprobs=None), prompt_token_id=[151644, 8948, 198, 2610, 525, 264, 10950, 17847, 13, 151645, 198, 151644, 872, 198, 24669, 220, 15, 25, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 198, 。。。, 198, 151644, 77091, 198], adapter_name=qwen. 2024-07-11 19:59:48,222 - lmdeploy - INFO - session_id=129, history_tokens=0, input_tokens=538, max_new_tokens=1024, seq_start=True, seq_end=True, step=0, prep=True

wxsms commented 3 months ago

Same problem here with internvl2 8B model, the server does not response after certain running time.

the error msg (not sure if related to this issue or not):

2024-08-05 09:37:02,625 - lmdeploy - INFO - ImageEncoder received 1 images, left 1 images.
2024-08-05 09:37:02,625 - lmdeploy - INFO - ImageEncoder process 1 images, left 0 images.
Exception in callback _raise_exception_on_finish(<Future finis...is WMF file')>) at /usr/local/lib/python3.10/dist-packages/lmdeploy/vl/engine.py:19
handle: <Handle _raise_exception_on_finish(<Future finis...is WMF file')>) at /usr/local/lib/python3.10/dist-packages/lmdeploy/vl/engine.py:19>
Traceback (most recent call last):
  File "/usr/lib/python3.10/asyncio/events.py", line 80, in _run
    self._context.run(self._callback, *self._args)
  File "/usr/local/lib/python3.10/dist-packages/lmdeploy/vl/engine.py", line 26, in _raise_exception_on_finish
    raise e
  File "/usr/local/lib/python3.10/dist-packages/lmdeploy/vl/engine.py", line 22, in _raise_exception_on_finish
    task.result()
  File "/usr/lib/python3.10/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
  File "/usr/local/lib/python3.10/dist-packages/lmdeploy/vl/engine.py", line 151, in forward
    outputs = self.model.forward(inputs)
  File "/usr/local/lib/python3.10/dist-packages/torch/utils/_contextlib.py", line 115, in decorate_context
    return func(*args, **kwargs)
  File "/usr/local/lib/python3.10/dist-packages/lmdeploy/vl/model/internvl.py", line 171, in forward
    images = [x.convert('RGB') for x in images]
  File "/usr/local/lib/python3.10/dist-packages/lmdeploy/vl/model/internvl.py", line 171, in <listcomp>
    images = [x.convert('RGB') for x in images]
  File "/usr/local/lib/python3.10/dist-packages/PIL/Image.py", line 941, in convert
    self.load()
  File "/usr/local/lib/python3.10/dist-packages/PIL/WmfImagePlugin.py", line 161, in load
    return super().load()
  File "/usr/local/lib/python3.10/dist-packages/PIL/ImageFile.py", line 366, in load
    raise OSError(msg)
ChunyiY commented 3 months ago

请问是否解决,我用lmdeploy推理LLM遇到了相似的问题...

irexyc commented 3 months ago

@wxsms 你的问题应该是图片 convert('RGB') 的时候报错了,原因应该是图片损坏,这一块lmdeploy内部目前并没有异常处理。

可以用lmdeploy处理之前,先看下图片convert能不能正常处理。

irexyc commented 3 months ago

INFO的日志不是能很好的反映问题。最好的方式是首先设置环境变量 export TM_DEBUG_LEVEL=DEBUG,这样会自动插入cuda的同步函数。另外启动服务的时候需要设置 --log-level=DEBUG,这样会打印debug的日志。

coolhok commented 3 months ago

我也遇到了相同的问题,使用lmdeploy v0.5.0 运行 InternVL-v15-chat 个人怀疑是:线程中异步的ImageEncoder前向 和 llm中的 forward导致的 cuda Launch 卡死。

zhyncs commented 3 months ago

try the latest release

coolhok commented 3 months ago

用了0.5.3版本尝试 TP=2 推理 InternVL-v15-chat features 这里返回是Nan,最终的输出logits 都是0,context无输出。TP=1 没有这个问题,切输出结果正常。 image

irexyc commented 3 months ago

@coolhok

使用pipeline接口,创建完pipeline之后,(with tp > 1),直接调用下面的语句会有问题么?

from lmdeploy.vl import load_image
im = load_image('image path')
pipe.vl_encoder.forward([im])
coolhok commented 3 months ago

@coolhok

使用pipeline接口,创建完pipeline之后,(with tp > 1),直接调用下面的语句会有问题么?

from lmdeploy.vl import load_image
im = load_image('image path')
pipe.vl_encoder.forward([im])

code

from lmdeploy import pipeline
from lmdeploy.vl import load_image
from lmdeploy import pipeline, TurbomindEngineConfig
import asyncio

pipe = pipeline('/mnt/workspace/model_hub/InternVL-Chat-V1-5/',
                backend_config=TurbomindEngineConfig(
                    tp=2, cache_max_entry_count=0.5))

im = load_image('./img/1.jpg')
r_sync = pipe.vl_encoder.forward([im])
print(f"r_sync = {r_sync}")

r_async = asyncio.run(pipe.vl_encoder.async_infer([im]))
print(f"r_sync = {r_async}")

result

[WARNING] gemm_config.in is not found; using default GEMM algo                                                                                           
[WARNING] gemm_config.in is not found; using default GEMM algo

r_sync = [tensor([[ 0.3127,  0.2219, -0.0070,  ...,  0.0645,  0.0034, -0.4839],
        [ 0.2047,  0.2479, -0.0424,  ..., -0.0375, -0.0715, -0.3203],
        [ 0.2976,  0.3047, -0.0497,  ..., -0.2800,  0.2542,  0.4961],
        ...,
        [ 0.0272,  0.1045,  0.4470,  ...,  0.2690,  0.3364, -0.7417],
        [ 0.0319,  0.0803,  0.4099,  ...,  0.2659,  0.3633, -0.7271],
        [ 0.0234,  0.0804,  0.4424,  ...,  0.3066,  0.3467, -0.7461]],
       dtype=torch.float16)]
r_sync = [tensor([[nan, nan, nan,  ..., nan, nan, nan],
        [nan, nan, nan,  ..., nan, nan, nan],
        [nan, nan, nan,  ..., nan, nan, nan],
        ...,
        [nan, nan, nan,  ..., nan, nan, nan],
        [nan, nan, nan,  ..., nan, nan, nan],
        [nan, nan, nan,  ..., nan, nan, nan]], dtype=torch.float16)]
coolhok commented 3 months ago

@irexyc use lmdeploy v0.5.3。TP=1 Running the generated data may also result in HTTP 499 freezing error,Simultaneous error stack

_forward_loop :Add more threads to save operations?

[2024-08-10 19:25:23] 2024-08-10 19:25:23,109 - lmdeploy - INFO - ImageEncoder received 1 images, left 1 images.
[2024-08-10 19:25:23] 2024-08-10 19:25:23,109 - lmdeploy - INFO - ImageEncoder process 1 images, left 0 images.
[2024-08-10 19:25:23] ERROR:asyncio:Exception in callback _raise_exception_on_finish(<Future finis... processed)')>) at /usr/local/lib/python3.10/site-packages/lmdeploy/vl/engine.py:19
[2024-08-10 19:25:23] handle: <Handle _raise_exception_on_finish(<Future finis... processed)')>) at /usr/local/lib/python3.10/site-packages/lmdeploy/vl/engine.py:19>
[2024-08-10 19:25:23] Traceback (most recent call last):
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/asyncio/events.py", line 80, in _run
[2024-08-10 19:25:23]     self._context.run(self._callback, *self._args)
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/lmdeploy/vl/engine.py", line 26, in _raise_exception_on_finish
[2024-08-10 19:25:23]     raise e
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/lmdeploy/vl/engine.py", line 22, in _raise_exception_on_finish
[2024-08-10 19:25:23]     task.result()
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/concurrent/futures/thread.py", line 58, in run
[2024-08-10 19:25:23]     result = self.fn(*self.args, **self.kwargs)
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/lmdeploy/vl/engine.py", line 151, in forward
[2024-08-10 19:25:23]     outputs = self.model.forward(inputs)
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
[2024-08-10 19:25:23]     return func(*args, **kwargs)
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/lmdeploy/vl/model/internvl.py", line 171, in forward
[2024-08-10 19:25:23]     images = [x.convert('RGB') for x in images]
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/lmdeploy/vl/model/internvl.py", line 171, in <listcomp>
[2024-08-10 19:25:23]     images = [x.convert('RGB') for x in images]
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/PIL/Image.py", line 916, in convert
[2024-08-10 19:25:23]     self.load()
[2024-08-10 19:25:23]   File "/usr/local/lib/python3.10/site-packages/PIL/ImageFile.py", line 266, in load
[2024-08-10 19:25:23]     raise OSError(msg)
[2024-08-10 19:25:23] OSError: image file is truncated (43 bytes not processed)