Nsight-compute fail when profile the kernel performance

spcl / open-earth-compiler

development repository for the open earth compiler

Other

76 stars 14 forks source link

I have compiled my example by such pass pipeline. oec-opt --stencil-shape-inference --convert-stencil-to-std --cse --parallel-loop-tiling='parallel-loop-tile-sizes=128,1,1' --canonicalize --test-gpu-greedy-parallel-loop-mapping --convert-parallel-loops-to-gpu --canonicalize --lower-affine --convert-scf-to-std --stencil-kernel-to-cubin ../test/Examples/test.mlir > temp.mlir mlir-translate --mlir-to-llvmir temp.mlir > temp.bc llc -O3 temp.bc -o temp.s clang -c temp.s -o temp.o nvcc --default-stream per-thread -allow-unsupported-compiler -ccbin clang main.cc temp.o -lcuda-runtime-wrappers -lcudart -lcuda Here are main.cc and test.mlir files in zip. Are there any steps wrong in my pipeline? I want to use ncu to profile more details. Thank you for your help! test.zip

spcl / open-earth-compiler

Nsight-compute fail when profile the kernel performance #50