Closed ppwwyyxx closed 8 years ago
Hi Yuxin,
I tried your C benchmark and it works great, thank you very much! I think you are perfectly right, it makes much more sense to compare our kernel with cublas. You just earned a place in our article acknowledgements :)
Hi, Your work is great! I did some benchmark in C++ and it shows that xnor is 23x faster than your baseline gemm, and 3.3x faster than nvidia cublas gemm, on GTX980.
Also, I think comparing with Theano doesn't make a lot of sense because it looks like everything got slow down in Theano. cublas would be a good baseline to compare with, since it is the fastest available gemm implementation.