hughperkins / tf-coriander

OpenCL 1.2 implementation for Tensorflow
Apache License 2.0
789 stars 94 forks source link

Execution speed is much slower than CPU in MacBook Pro? #71

Open elife33 opened 6 years ago

elife33 commented 6 years ago

In my MacBook Pro(Mid 2015), Execution speed in AMD Radeon R9 M370X Compute Engine is much slower than CPU(2.5 GHz Intel Core i7). Is this normal?

elife33 commented 6 years ago

GPU:

real 0m42.142s user 0m31.954s sys 0m18.096s

CPU: QiangdeMacBook-Pro:3_NeuralNetworks elife$ time python multilayer_perceptron.py Extracting /tmp/data/train-images-idx3-ubyte.gz Extracting /tmp/data/train-labels-idx1-ubyte.gz Extracting /tmp/data/t10k-images-idx3-ubyte.gz Extracting /tmp/data/t10k-labels-idx1-ubyte.gz 2017-10-16 15:53:15.945534: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations. 2017-10-16 15:53:15.945553: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations. 2017-10-16 15:53:15.945568: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX2 instructions, but these are available on your machine and could speed up CPU computations. 2017-10-16 15:53:15.945572: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use FMA instructions, but these are available on your machine and could speed up CPU computations. Epoch: 0001 cost=333.116085219 Epoch: 0002 cost=97.091073028 Epoch: 0003 cost=71.633393175 Epoch: 0004 cost=58.102855900 Epoch: 0005 cost=48.550296390 Epoch: 0006 cost=43.879424709 Epoch: 0007 cost=38.416489090 Epoch: 0008 cost=35.230118899 Epoch: 0009 cost=32.181227765 Epoch: 0010 cost=30.920996809 Epoch: 0011 cost=28.272015703 Epoch: 0012 cost=26.596986358 Epoch: 0013 cost=25.029801500 Epoch: 0014 cost=23.558648788 Epoch: 0015 cost=22.414703977 Optimization Finished! Accuracy: 0.8971

real 0m32.758s user 2m15.099s sys 0m10.327s

elife33 commented 6 years ago

Bitcoin mining Data courtesy CompuBench Radeon R9 M370X Mac 111.14 mHash/s

CompuBench 1.5 (Bitcoin mining) Data courtesy CompuBench Core i7 4870HQ 30.62 mHash/s

elife33 commented 6 years ago

Face detection Data courtesy CompuBench Radeon R9 M370X Mac 25.65 mPixels/s

CompuBench 1.5 (Face detection) Core i7 4870HQ 18.08 mPixels/s

hughperkins commented 6 years ago

It is. Since all convolutions are running on the CPU currently. I never quite got around to attaching https://github.com/hughperkins/coriander-dnn to tf-coriander, so convolutions are on the cpu currently. If someone has a moment, would be good to get that working. Happy to provide guidance, meet in Google Hangouts etc, if anyone wants to take a look at that.

datatalking commented 1 year ago

@hughperkins, I'm open to try as i'm working to get a 560 running on high sierra via gpu