NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines
Other
5.35k stars 901 forks source link

[QST] How to compile and run `examples/35_gemm_softmax` ? #1728

Closed hxdtest closed 1 week ago

hxdtest commented 3 weeks ago

What is your question? When I run examples/35_gemm_softmax, I use nvcc --expt-relaxed-constexpr -I /mnt5/xuantai.hxd/cutlass/include -I /mnt5/xuantai.hxd/run_cutlass/cutlass_gemm_softmax -I /mnt5/xuantai.hxd/cutlass/tools/util/include gemm_softmax.cu -o run -std=c++17 to compile the file. But the ouputs are void cutlass::arch::ldsm(cutlass::Array<unsigned int, MatrixCount, true> &, const void *) [with Layout = cutlass::layout::RowMajor; int MatrixCount = 4] not implemented Device synchronize failed with error unspecified launch failure? Is there anything wrong with compile commands?

thakkarV commented 3 weeks ago

Your command line flags are wrong. Please use the cutlass cmake build system or replicate its flags.