Open leiwen83 opened 3 years ago
Do you mind sharing the binary so that we can use it for debugging? Thanks.
Hi,
using the onnx2trt could also reproduce it. onnx file could be found at: https://media.githubusercontent.com/media/onnx/models/master/vision/classification/resnet/model/resnet50-v2-7.onnx
Then using like instr_count.so:
LD_PRELOAD=./instr_count.so onnx2trt -b 1 -d 16 -w 20000000000 resnet50-v2-7.onnx -o 1.trt
I am not able to reproduce it with my local binary.
trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_small_nhwc_linkable_tn_v1
but only
trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_small_nhwc_tn_v1
, trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_medium_nhwc_tn_v1
and trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_large_nhwc_tn_v1
.trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_*
kernels.
Hi,
When I use nvbit to trace one program containing TensorRT kernel, it report illegal memory access for the sample plugin like instr_count or instr_count_bb.
The kernel name is trt_ampere_h1688cudnn_128x128_ldg8_relu_exp_small_nhwc_linkable_tn_v1, and tensorrt version is 7.2.1, while nvbit is also the latest version.
Thx, Lei