ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
213 stars 145 forks source link

Add kernel helper write fxns #1983

Closed bstefanuk closed 4 weeks ago

bstefanuk commented 1 month ago

This is a part of a larger refactor of writeKernels in TensileCreateLibrary.

Notable changes

Testing