Idein / py-videocore

Python library for GPGPU on Raspberry Pi
MIT License
795 stars 89 forks source link

Getting correct values in matrix multiplication #4

Closed wywfalcon closed 8 years ago

wywfalcon commented 8 years ago

Hello there, I came across this code and wanted to use your matrix multiplication implementation (sgemm.py). It seems like the result I get with (numpy's reference calculation) A.dot(B) + C differ from the output of the program. For example, I used 64 x 64 matrixes for A, B and C each filled with ones. While I get the result of a matrix filled with 65s in the numpy's reference calculation, I get only one row filled with 65s in the output of the program in the matrix C. Does this mean I am reading from the wrong matrix for the output or something else is acting strange? Thanks.

Alpha and beta is kept at the value 1

wywfalcon commented 8 years ago

Okay, it only works on Pi 0. This information was hidden somewhere in the Japanese tutorial and it works as tested.