coreweave / ml-containers

MIT License
19 stars 3 forks source link

build(torch): Build with CUDA 12.2.2 and NCCL v2.18.5 #46

Closed Eta0 closed 10 months ago

Eta0 commented 10 months ago

torch Images with CUDA 12.2.2 & NCCL v2.18.5

This change adds torch images built with CUDA 12.2.2, and updates all CUDA 12 torch:nccl images to NCCL v2.18.5 using the base images from coreweave/nccl-tests#26.

We were previously unable to build the full suite of torch images from CUDA 12.2.0 due to its lack of cuDNN support, but CUDA 12.2.2 has cuDNN support again, so this is our first update supporting CUDA 12.2.