Open hangst opened 8 years ago
I'd like that a lot. Unfortunately, I don't have much time. Please feel free to re-implement my code with other QPU assemblers if you want.
2016年5月29日(日) 17:19 thomashang notifications@github.com:
I've tried the python example codes on my RPi2 and the multithreaded sgemm computation time is really amazing.
I wonder if there is sgemm implementation in C such that it can be used in other applications coded in C that requires matrix computation heavily (e.g. convolution in computer vision)
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/nineties/py-videocore/issues/3, or mute the thread https://github.com/notifications/unsubscribe/AAKiGieFYXcX071p_qWWN5CcQv5qiq8iks5qGUwPgaJpZM4IpPX9 .
It's here: https://github.com/Terminus-IMRC/qmkl
I've tried the python example codes on my RPi2 and the multithreaded sgemm computation time is really amazing.
I wonder if there is sgemm implementation in C such that it can be used in other applications coded in C that requires matrix computation heavily (e.g. convolution in computer vision)