Add sm_90a arch gencode to docker

xuzhao9 commented 3 months ago

We need to override the upstream config and add sm_90a gencode to the docker.

Test plan: https://github.com/pytorch/benchmark/actions/runs/9704484783

https://github.com/pytorch/benchmark/actions/runs/9704484783/job/26784787989#step:5:2057

 #11 900.6 + /usr/local/cuda-12.4/bin/nvprune -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a /usr/local/cuda-12.4/lib64/libcublas_static.a -o /usr/local/cuda-12.4/lib64/libcublas_static.a
#11 901.5 + /usr/local/cuda-12.4/bin/nvprune -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a /usr/local/cuda-12.4/lib64/libcublasLt_static.a -o /usr/local/cuda-12.4/lib64/libcublasLt_static.a

Nightly docker build: https://github.com/pytorch/benchmark/actions/runs/9704719476

facebook-github-bot commented 3 months ago

@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot commented 3 months ago

@xuzhao9 merged this pull request in pytorch/benchmark@740e15a57e604747ecef91bc803f92bea04dcc16.

pytorch / benchmark

Add sm_90a arch gencode to docker #2338