ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.
MIT License
208 stars 142 forks source link

Replace YAML operations with C libyaml backend #1937

Closed bstefanuk closed 1 month ago

bstefanuk commented 2 months ago

Objectives

Testing:

TensileCreateLibrary

When timing the execution of TensileCreateLibrary on gfx90a architectures, the following differences were noted:

Command issued:

$ time Tensile/bin/TensileCreateLibrary ${rocblas_root}/library/src/blas3/Tensile/Logic/asm_full/aldebaran build_profiler HIP --merge-files --separate-architecture --lazy-library-loading --no-short-file-names --no-library-print-debug --code-object-version=default --cxx-compiler=hipcc --jobs=64 --library-format=msgpack --architecture=gfx90a

rocBLAS

When building rocBLAS, the following differences were noted:

Command issued:

$ ./install.sh
bstefanuk commented 2 months ago

Just double check if this requires a new dependency that would need to get approved and documents.

@AlexBrownAMD I have tested a simple yaml load on Windows and confirmed that the necessary dependencies are installed by default on Ubuntu 22.04/20.04 and Rocky Linux (see links)

https://releases.ubuntu.com/jammy/ubuntu-22.04.4-desktop-amd64.manifest https://releases.ubuntu.com/focal/ubuntu-20.04.6-desktop-amd64.manifest https://download.rockylinux.org/pub/rocky/9/BaseOS/x86_64/os/Packages/l/