ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
208 stars 142 forks source link

Refactor logic file discovery #1931

Closed bstefanuk closed 2 months ago

bstefanuk commented 2 months ago

Objectives:

Outcomes:

Testing:

Unit testing:

Command to run unit tests (from Tensile/Tensile/Tests):

$ pytest unit/test_TensileCreateLibrary.py

Integration testing:

Notable changes:

Docker environment info:

$ cat /etc/os-release | head -1
PRETTY_NAME="Ubuntu 22.04.3 LTS"
$ apt show rocm-libs -a | head -2 | tail +2
Version: 6.1.2.60102-119~22.04
$ uname -r
5.15.0-86-generic