mitmul / ssai-cnn

Semantic Segmentation for Aerial / Satellite Images with Convolutional Neural Networks including an unofficial implementation of Volodymyr Mnih's methods
http://www.ingentaconnect.com/content/ist/jist/2016/00000060/00000001/art00003
MIT License
260 stars 75 forks source link

Tesla P100 is slower #17

Closed Tejuwi closed 6 years ago

Tejuwi commented 6 years ago

Hi,

I have tested an algorithm on Tesla p100 (Ubuntu Server16.04 LTS x86_64) and it takes one epoch 2Hrs. I applied same algorithm on Quadro M4000 (Ubuntu desktop 16.04 LTS x86_64) takes 2Hrs 40min.

We expected that training time will comedown to half of the Quadro M4000 but their no much difference between Tesla p100 and Quadro M4000.

Please give me guidance so that the training time will be reduced. I appreciate you kind help

Tejuwi commented 6 years ago

I think it depends on size of image. GPU performance is well now and i think it depends on load on it