ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
208 stars 143 forks source link

Help enabling wmma instructions #1943

Open spayne opened 2 months ago

spayne commented 2 months ago

I used the the example at the bottom of https://github.com/ROCm/Tensile/wiki#quick-example-ubuntu to generate a set of solutions.

However, I noticed that none of the generated solutions are using the WMMA instructions. My architecture does have HasWMMA=1.

I confirmed that kernel["EnableMatrixInstructions'] is False in the kernel writer code... so that's not good - the problem is probably up the stack from there.

I thought I could add EnableMatrixInstructions to the rocblas_sgemm_example.yaml but I get an error Tensile::FATAL: Invalid parameter name: EnableMatrixInstruction

So:

Thanks!