Open amnash opened 5 years ago
What would be the best way to run multi-node training on cloud compute instances? Similar to multi-node DGX1/DGX2 training using slurm?
What would be the best way to run multi-node training on cloud compute instances? Similar to multi-node DGX1/DGX2 training using slurm?