Open bo-wang-up opened 3 months ago
Hi, I encountered an issue where autoencoder training cannot be implemented on multiple GPUs. The training always pauses with a 'Missing logger folder' error, as shown in the image below.
Hi, sorry for the delayed response. Could you please specify
Thanks.
Thanks for your reply. I have solved the problem. I changed the backend from 'nccl' to 'gloo'
Hi, I encountered an issue where autoencoder training cannot be implemented on multiple GPUs. The training always pauses with a 'Missing logger folder' error, as shown in the image below.