Open xpillons opened 1 year ago
Reason is because Slurm restrict the number of GPUs based on the --gpus job option, meaning that the number of GPU visible when running in a job context can be smaller than the number of GPU devices. The vglrun alias command line is using the number of NVIDIA devices to set the number of GPU, and this is wrong for Slurm
when running glxspheres64 on a shared node with vglrun a seg fault is generated, while running without works.