Closed xuzhao9 closed 3 months ago
We need to override the upstream config and add sm_90a gencode to the docker.
sm_90a
Test plan: https://github.com/pytorch/benchmark/actions/runs/9704484783
https://github.com/pytorch/benchmark/actions/runs/9704484783/job/26784787989#step:5:2057
#11 900.6 + /usr/local/cuda-12.4/bin/nvprune -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a /usr/local/cuda-12.4/lib64/libcublas_static.a -o /usr/local/cuda-12.4/lib64/libcublas_static.a #11 901.5 + /usr/local/cuda-12.4/bin/nvprune -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a /usr/local/cuda-12.4/lib64/libcublasLt_static.a -o /usr/local/cuda-12.4/lib64/libcublasLt_static.a
Nightly docker build: https://github.com/pytorch/benchmark/actions/runs/9704719476
@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
@xuzhao9 merged this pull request in pytorch/benchmark@740e15a57e604747ecef91bc803f92bea04dcc16.
We need to override the upstream config and add
sm_90a
gencode to the docker.Test plan: https://github.com/pytorch/benchmark/actions/runs/9704484783
https://github.com/pytorch/benchmark/actions/runs/9704484783/job/26784787989#step:5:2057
Nightly docker build: https://github.com/pytorch/benchmark/actions/runs/9704719476