NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
create_execution_context() doesn't release the Python GIL, see the pybind code here. Would it be possible to add a py::call_guard<py::gil_scoped_release>{} there? On a Jetson TX2, for my model, it takes about 7.5 seconds to create the execution context. This makes my semi-realtime program unresponsive during this period, and it would be nicer if the Python interpreter kept functioning.
Description
create_execution_context() doesn't release the Python GIL, see the pybind code here. Would it be possible to add a
py::call_guard<py::gil_scoped_release>{}
there? On a Jetson TX2, for my model, it takes about 7.5 seconds to create the execution context. This makes my semi-realtime program unresponsive during this period, and it would be nicer if the Python interpreter kept functioning.