branch :https://github.com/triton-inference-server/fastertransformer_backend.git main
docker version : nvcr.io/nvidia/tritonserver:21.07-py3
GPU: Tesla V100
when i flow the steps to build fastertransformer backend, an error happend:
451 BackendModel(TRITONBACKEND_Model* triton_model);
/workspace/build/_deps/repo-backend-src/include/triton/backend/backend_model.h:45:3: note: candidate expects 1 argument, 2 provided
[100%] Linking cxX executable../../../../../bin/multi_gpu_gpt_triton_example
[100%] Built target multi_gpu_gpt_triton_example
make[2]:***[CMakeFiles/triton-fastertransformer-backend.dir/build.make:82:CMakeFiles/triton-fastertransformer-backend.dir/src/libfastertransforner.cc.o] Error 1 make[1]:***[CMakeFiles/Makefile2:1457:CMakeFiles/triton-fastertransformer-backend.dir/all] Error 2 make[1]: *** Waiting for unfinished jobs....
/workspace/build/_deps/repo-ft-src/src/fastertransformer/utils/nccl_utils.h:In function'bool test_context_sharing(const strings, const strings) [with T = float]'
/workspace/build/_deps/repo-ft-src/src/fastertransformer/utils/ncclutils.h:72:144:warning:'pipeline_para.fastertransformer::NcclParam:inccl_uid’may be used uninitiali ed in this function [-Wmaybe-uninitialized]
721 NcclParam(NcclParam const& param)
/workspace/bui1d/_deps/repo-ft-src/src/fastertransformer/utils/nccutils.h:72:144:warning:'tensor_para.fastertransformer::NcclParam::nccl_uid_’ may be used uninitialize in this function [-Wmaybe-uninitialized]
721 NcclParam(NcclParam const& param):
A
/
[100%] Linking CXX executable ../../../../bin/test_context_decoder_layer
[100%] Built target test_context_decoder_layer
[100%] Linking CXX executable ../../..../bin/test_sampling
[100%] Built target test_sampling
make: *** [Makefile:149: all] Error 2
Description
Reproduced Steps