ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
208 stars 144 forks source link

Refactor kernel duplicate matching #1963

Closed bstefanuk closed 1 month ago

bstefanuk commented 1 month ago

Simple refactor in TensileCreateLibrary, creating a function to enclose kernel matching logic.

Testing

Unit tests included

Notable changes

Raise a DeveloperWarning instead of normal printing when duplicates are encountered.