daadaada / turingas

Assembler for NVIDIA Volta and Turing GPUs
MIT License
196 stars 41 forks source link

TuringAs

An open source SASS assembler for NVIDIA Volta, Turing, and Ampere GPUs.

Requirements:

Supported hardware:

All NVIDIA Volta (SM70), Turing (SM75), and Ampere (SM80) GPUs.

Other features:

Install the library

python setup.py install

Use the library

python -m turingas.main -i <input.sass> -o <output.cubin> -arch <arch>

# E.g.
python -m turingas.main -i input.sass -o output.cubin -arch 75 # 75 for Turing
                                                               # or 70 (Volta), 80 (Ampere)

Related projects

AsFermi, an SASS assembler for NVIDIA Fermi GPUs. By Hou Yunqing.

MaxAs, an SASS assembler for NVIDIA Maxwell and Pascal. By Scott Gray.

KeplerAs, an SASS assembler for NVIDIA Kepler. By Xiuxia Zhang.

TODO list

To support following instructions:

Citation

If you find this tool helpful, please cite:

@inproceedings{yan2020winograd-conv,
  author    = {Da Yan and
               Wei Wang and
               Xiaowen Chu},
  title     = {Optimizing Batched Winograd Convolution on GPUs},
  booktitle = {25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '20)},
  year      = {2020},
  address   = {San Diego, CA, USA},
  publisher = {ACM},
}

This project is released under the MIT License.

-- Da Yan