ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
212 stars 145 forks source link