Hello there, I came across this code and wanted to use your matrix multiplication implementation (sgemm.py). It seems like the result I get with (numpy's reference calculation) A.dot(B) + C differ from the output of the program. For example, I used 64 x 64 matrixes for A, B and C each filled with ones. While I get the result of a matrix filled with 65s in the numpy's reference calculation, I get only one row filled with 65s in the output of the program in the matrix C. Does this mean I am reading from the wrong matrix for the output or something else is acting strange? Thanks.
Hello there, I came across this code and wanted to use your matrix multiplication implementation (sgemm.py). It seems like the result I get with (numpy's reference calculation) A.dot(B) + C differ from the output of the program. For example, I used 64 x 64 matrixes for A, B and C each filled with ones. While I get the result of a matrix filled with 65s in the numpy's reference calculation, I get only one row filled with 65s in the output of the program in the matrix C. Does this mean I am reading from the wrong matrix for the output or something else is acting strange? Thanks.
Alpha and beta is kept at the value 1