Slow Inference time using Thermal Camera

Our current system is able to fetch images from the camera, and run inference on them. However, inference time performance is below the expected. Current results show an average of 25ms inference time, which is below the requirements specification (10ms).

One of the bottlenecks which might lead to this performance degradation is the simple way in which the trt_ssd.py script runs inference, that is, in a sequential manner (i.e. no threading/parallelism).

Another possible source of slow inference time can be the "mechanism" being used to fetch data from the camera. Currently, the code relies on OpenCV's inner mechanisms to deal with the camera device interfacing (cv2.VideoCapture(0)). In other words, no specific pipeline parameters are provided to the function. It is left to OpenCV to figure out how to open the device. Some investigation has already been made on this matter, and tested on an RGB USB camera (without threading), and performance degradation can be seen when the GStream pipeline is not properly specified.

A third source of performance degradation is the usage of python code, instead of NVIDIA's C++ API. The performance penalty imposed by python's interpreter can lead to a degree of slow down. Nevertheless, it is still unknown how much can this penalty be. Nevertheless, users have reported performance of ~12.69 ms inference time (no multi-threading) running on the Jetson Xavier AGX (Jetpack 4.2.2 + TensorRT 5) on python. This would imply that the requirement can still be met in spite of the python performance penalty. However, performance differences have also been observed with different TensorRT versions.

Tasks:

Investigate the performance improvement (inference time) for the thermal camera using threads.
Investigate and determine the penalty (if any) of OpenCV's inner functionalites vs specifying the characteristics of the GStreamer pipeline.
Determine the possibility of performance degradation due to the new versions of TensorRT6 + Jetpack 4.3.
Implement the solution which leads to meeting the 10ms inference time.

tue-mps-edu / asd-engd-project-2019-thermal-object-detection

Slow Inference time using Thermal Camera #38