TimoSaemann / ENet

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
577 stars 276 forks source link

Really bad performance on TX2 #63

Closed IAmSaad closed 6 years ago

IAmSaad commented 6 years ago

The ENet paper claims to have 21fps on TX1 at the resolution 480*320 but I am getting 2fps (400ms forward pass time). Is there some kind of optimization that needs to be done.

sunmiaozju commented 6 years ago

@IAmSaad hello,could I know which jetpack you used in the TX2 ? And how you deploy the ENet on the TX2 ? We also tried to deploy the ENet on the TX2 but maybe there is something wrong. Thank you very much!

IAmSaad commented 6 years ago

We used the Jetpack 3.0 I guess. Try to find the use_cpu or use_gpu (I don't remember exactly what they were called) but the basic problem with our setup was that the these were not setup properly. We were running in the CPU mode before we found out about the switches. After that, we were getting like 10fps for 480*360 input.

softbai commented 5 years ago

We used the Jetpack 3.0 I guess. Try to find the use_cpu or use_gpu (I don't remember exactly what they were called) but the basic problem with our setup was that the these were not setup properly. We were running in the CPU mode before we found out about the switches. After that, we were getting like 10fps for 480*360 input.

@IAmSaad
Can I ask that how did you run the scripts by using pre-trained model? I test the time that the code "prediction = net.blobs['deconv6_0_0'].data[0].argmax(axis=0)" needs 2s while inference just uses 0.3s. Can you tell me the script you used? Thank you!