When I run the example of using ABACUS, everything gone well except the training is canceled by the slurm system due to the time limit.
Because the maximum time of using the GPU accelerate card is limited for one submit in our cluster. I wander if I can restart the training when next submit ?
Hi,
When I run the example of using ABACUS, everything gone well except the training is canceled by the slurm system due to the time limit.
Because the maximum time of using the GPU accelerate card is limited for one submit in our cluster. I wander if I can restart the training when next submit ?