Open InkChan opened 2 years ago
Learn-CUDA-Programming/tree/master/Chapter02/02_memory_overview/01_sgemm)/sgemm.cu segmm_gpu_kernel: sum += A[i + row K] B[col + i * M]; I think it should be M
yeah , you are right. And the annotation in this file should be: //! @param K width of matrix A and height of matrix C => B
Learn-CUDA-Programming/tree/master/Chapter02/02_memory_overview/01_sgemm)/sgemm.cu segmm_gpu_kernel: sum += A[i + row K] B[col + i * M]; I think it should be M