DDMAL / hpc-trainer-component

MIT License
0 stars 0 forks source link

Request Specific GPU Nodes for > 125 GB of Memory #2

Closed JRegimbal closed 4 years ago

JRegimbal commented 4 years ago

See https://docs.computecanada.ca/wiki/Using_GPUs_with_Slurm. Essentially if we request more memory than 125 GB and don't specify a specific GPU model, it will attempt to assign us to a P100 node without enough memory and the job will not be scheduled.