ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
208 stars 143 forks source link

Remove globals from prepAsm #1962

Closed bstefanuk closed 1 month ago

bstefanuk commented 1 month ago

Removes globals from prepAsm in TensileCreateLibrary and adds type hinting for writeKernels