ardianumam / Tensorflow-TensorRT

This repository is for my YT video series about optimizing a Tensorflow deep learning model using TensorRT. We demonstrate optimizing LeNet-like model and YOLOv3 model, and get 3.7x and 1.5x faster for the former and the latter, respectively, compared to the original models.
303 stars 110 forks source link

Yolov3 model operation did not accelerate significantly #30

Open LeeWise9 opened 4 years ago

LeeWise9 commented 4 years ago

Hello, thank you very much for sharing. It's a great project. After I use tensor-rt to convert yolov3 model according to your steps, I find that the calculation improvement is not obvious. The FPS of the original model and tensorrt model are about 30. The platform I use is Google colab. It's strange that on a CPU only platform, acceleration is obvious, while GPU platform acceleration is not. I'm very confused about this. What's wrong? (Has Google colab optimized the model calculation?)

han1366 commented 4 years ago

Hello, thank you very much for sharing. It's a great project. After I use tensor-rt to convert yolov3 model according to your steps, I find that the calculation improvement is not obvious. The FPS of the original model and tensorrt model are about 30. The platform I use is Google colab. It's strange that on a CPU only platform, acceleration is obvious, while GPU platform acceleration is not. I'm very confused about this. What's wrong? (Has Google colab optimized the model calculation?)

me too

MuhammadAsadJaved commented 3 years ago

+1 , The speed and model size is the same. However, the number of nodes slightly reduced. Screenshot from 2020-09-28 12-14-27