使用client请求triton服务器失败

standyyyy commented 1 year ago

使用client请求triton失败，以下是一些简要配置情况。 OS: [e.g. linux] linux Python/C++ Version：client环境python3.9 Package Version：pytorch、torchaudio、modelscope、funasr version（pip list） Model：infer_pipeline Command：python3 client/decode_manifest_triton.py \ --server-addr $serveraddr \ --compute-cer \ --model-name infer_pipeline \ --num-tasks $num_task \ --manifest-filename $manifest_path Details：无，按照文档操作，文档地址是https://github.com/alibaba-damo-academy/FunASR/tree/main/funasr/runtime/triton_gpu Error log：tritonclient.utils.InferenceServerException: [StatusCode.INTERNAL] in ensemble 'infer_pipeline', Failed to process the request(s) for model instance 'scoring_0_0', message: Failed to open the cudaIpcHandle. error: invalid resource handle

服务端这边按照文档上的命令，已经搭建了，在容器中也映射了端口出来。