generalize cmake to build for different cuda archs

NVIDIA / TorchFort

An Online Deep Learning Interface for HPC programs on NVIDIA GPUs

Other

154 stars 19 forks source link

Problem

My GPU has a CUDA architecture of 89 (GTX 4080). Currently the CMakeLists.txt is only setup to handle architectures that end in 0.

Solution

I modified the string replacement to handle generic CUDA architectures.

Fixes #8

Notes

Also currently CMakeLists.txt only builds 70 and 80 by default. You will have to add 89 to TORCHFORT_CUDA_CC_LIST in the following line if you want to build for all 3 (i.e., TORCHFORT_CUDA_CC_LIST "70;80;89"). https://github.com/NVIDIA/TorchFort/blob/e06613d6feccc3d11c166f146abce7abdd85f1b3/CMakeLists.txt#L5

If you do not add 89 to TORCHFORT_CUDA_CC_LIST you will get the following error when you try to run the binary.

Accelerator Fatal Error: This file was compiled: -acc=gpu -gpu=cc70 -gpu=cc80 -acc=host or -acc=multicore
Rebuild this file with -gpu=cc89 to use NVIDIA Tesla GPU 0

NVIDIA / TorchFort

generalize cmake to build for different cuda archs #7

Problem

Solution

Notes