Closed jwxsp1 closed 1 year ago
Hi @jwxsp1, as the error message mentioned, you're using the slurm trainer, which will produce an error if SLURM is not found. If your cluster does not use slurm, you should change to a different trainer such as gpu_1_host
.
Hi @jwxsp1, as the error message mentioned, you're using the slurm trainer, which will produce an error if SLURM is not found. If your cluster does not use slurm, you should change to a different trainer such as
gpu_1_host
.
Hello, thank you for your reply. I have tried to replace the trainer, but there seems to be a new problem. How can I solve this problem? I would appreciate it if you would like to answer it.
Hi, when I try to train the model, I have an error:
Do you know how to fix it?