tlc-pack / relax

Apache License 2.0
193 stars 58 forks source link

[CUTLASS] Update cutlass generator to add fp32 SIMT kernels #451

Closed vinx13 closed 1 year ago

vinx13 commented 1 year ago

This PR syncs cutlass generator with upstream and enable fp32 SIMT kernels for SM75 (SM80 by default uses TF32 for FP32)

cc @masahi @yelite