Open A-ML-ER opened 1 year ago
I0404 14:43:41.957637 63955 server.cc:594] +-------------------+---------+-----------------------------------------------------------------------------------------------------+ | Model | Version | Status | +-------------------+---------+-----------------------------------------------------------------------------------------------------+ | fastertransformer | 1 | UNAVAILABLE: Not found: unable to load shared library: /opt/tritonserver/backends/fastertransformer | | | | /libtriton_fastertransformer.so: undefined symbol: _ZN22ParallelGptTritonModelI6__halfE8toStringB5c
Do you change any code?
I had the same problem , yes, I change code, just add support a new model. This symbol is not found in libtriton_fastertransformer.so:
nm -D libtriton_fastertransformer.so | grep ParallelGptTritonModel | grep toString
But it's found in libtransformer-shared.so, I see the same without modifying the code, but no error is reported.
how to fix it?
Do you add the new model in https://github.com/NVIDIA/FasterTransformer/blob/main/CMakeLists.txt#L317?
@byshiue yes, I had add the new model in transformer-shared. and I had add some code in src/fastertransformer/triton_backend, such as tritonmodel and tritonmodelinstance. and I also add my new model in fastertransfomer_backend repo src/libfastertransformer.cc file. The code now feels fine.
@A-ML-ER Have you solved the problem?
@byshiue I have same problem https://github.com/triton-inference-server/fastertransformer_backend#rebuilding-fastertransformer-backend-optional
cmake \
-D CMAKE_EXPORT_COMPILE_COMMANDS=1 \
-D CMAKE_BUILD_TYPE=Release \
-D ENABLE_FP8=OFF \
-D BUILD_MULTI_GPU=ON \
-D BUILD_PYT=ON \
-D SM=80 \
-D CMAKE_INSTALL_PREFIX=/opt/tritonserver \
-D TRITON_COMMON_REPO_TAG="r${NVIDIA_TRITON_SERVER_VERSION}" \
-D TRITON_CORE_REPO_TAG="r${NVIDIA_TRITON_SERVER_VERSION}" \
-D TRITON_BACKEND_REPO_TAG="r${NVIDIA_TRITON_SERVER_VERSION}" \
..
I need to use BUILD_PYT=ON
.
But, same error occured.
UNAVAILABLE: Not found: unable to load shared library: /opt/tritonserver/backends/fastertransformer/libtriton_fastertransformer.so: undefined symbol: _ZN22ParallelGptTritonModelI6__halfE8toStringB5cxx1
Description
Reproduced Steps