ardianumam / Tensorflow-TensorRT

This repository is for my YT video series about optimizing a Tensorflow deep learning model using TensorRT. We demonstrate optimizing LeNet-like model and YOLOv3 model, and get 3.7x and 1.5x faster for the former and the latter, respectively, compared to the original models.
303 stars 110 forks source link

No Speedup Observed #31

Open swchew opened 4 years ago

swchew commented 4 years ago

Hi, thank you for this excellent tutorial guide! The problem I encountered was that I was unable to observe any speedup when I ran the code "7_optimizing_YOLOv3_using_TensorRT). In fact, for both with and without TensorRT I observed a slow fps of only 1.5fps.

There were 2 warnings in the console:

  1. I tensorflow/core/grappler/devices.cc:51] Number of eligible GPUs (core count >= 8): 0
  2. I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA

Any advice is appreciated!