NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Visually comparing the images generated by these two models, there indeed seems to be a slight difference between the two. Now I want to know if there are any ways to improve the accuracy of FP16 tensorrt model ?
I used Polygraphy to compare the accuracy of ONNX FP32 and TensorRT FP16 with following command :
and the output log showed that some nodes failed :
Visually comparing the images generated by these two models, there indeed seems to be a slight difference between the two. Now I want to know if there are any ways to improve the accuracy of FP16 tensorrt model ?