Closed yazdanbakhsh closed 5 months ago
For now, I removed the host compare so I can run the CUDA part. Only the static part works tho, why it is the case?
./matmul_example -> ERROR
** On entry to cusparseLtMatmulDescriptorInit(): matrix type/compute type combination is not supported, current: IN=CUDA_R_16BF, OUT=CUDA_R_16BF, COMPUTE=COMPUTE_TF32
CUSPARSE API failed at line 206 with error: operation not supported (10)
./matmul_example_static -> WORKED (still removing host comparison part)
@yazdanbakhsh could you please share the command line that you used?
@yazdanbakhsh I was able to reproduce the compile error and we make the fix soon. Thanks for catching it.
The fix will be simply changing line 308 to
hC_result[posC] = static_cast<C_t>(alpha * sum + beta * static_cast<float>(hC[posC])); // [i][j]
Regarding ./matmul_example -> ERROR, could you set enviroment CUSPARSELT_LOG_LEVEL=5 and rerun it?
Cuda compilation tools, release 12.4, V12.4.131 20.04.1-Ubuntu cuSPARSELt 0.6.1 NVIDIA A100-SXM4-40GB Driver Version: 550.54.15
I am trying to compile the code in matmul, but keep getting the following error: