hughperkins / tf-coriander

OpenCL 1.2 implementation for Tensorflow
Apache License 2.0
792 stars 90 forks source link

speed slower than only using cpu #77

Open subshall opened 6 years ago

subshall commented 6 years ago

my gpu is gt940m cuda vision is 8.0 cudnn visions 5.1 and i run the examples,it works and the execution times is similar as you. but for the recurrent networks, Coriander is around 20 times slower than using CUDA .not four and even slower than using cpu. is this normal ?