DeiC-HPC / cotainr

cotainr - a user space Apptainer/Singularity container builder.
European Union Public License 1.2
17 stars 3 forks source link

Update to LUMI PyTorch Example 09/2023 #41

Closed Chroxvi closed 9 months ago

Chroxvi commented 10 months ago

Update to the LUMI PyTorch examples to the most recent stable versions of Python, ROCm, and PyTorch as of 09/2023. Also includes updates to the LUMI SLURM scripts since the eap partition is no longer available on LUMI.

I have tested these examples on LUMI using cotainr build lumi_pytorch_rocm_demo.sif --base-image docker://rocm/dev-ubuntu-22.04:5.6.1-complete --conda-env py311_rocm542_pytorch.yml, since the --system=lumi-g option still provides the "rocm-terminal" image which does not include all the ROCm pieces needed for the PyTorch wheels.

A few notes about performance of the examples: