Closed lix19937 closed 1 week ago
In trt plugin, when there is a printf call inside cuda kernels, the calc result is diff without printf call in cuda kernel. So strange !
TensorRT Version:trt8510
NVIDIA GPU:rtx3070
NVIDIA Driver Version:
CUDA Version:11.4
CUDNN Version:11.6
Operating System:ubuntu20.04
After multi-check, it is cuda's bug for IO bus is busy. Remove printf calls in kernels, will get right answer/result.
Description
In trt plugin, when there is a printf call inside cuda kernels, the calc result is diff without printf call in cuda kernel. So strange !
Environment
TensorRT Version:trt8510
NVIDIA GPU:rtx3070
NVIDIA Driver Version:
CUDA Version:11.4
CUDNN Version:11.6
Operating System:ubuntu20.04