Closed djr2015 closed 3 years ago
Hello, I also encountered the problem of too long training time. What can I do to speed up the training? I noticed that torch.set_num_threads()
can be set in the training script. If I have a multi-core CPU, such as 20 cores, can I speed up training by using more CPU threads? I noticed your comment, that don't use too many CPU threads, But does this also affect CPUs with more cores?
Thank you for your work and accompanying codebase!
1) I am able to run
scripts/box_vae_chair.sh
, but I am finding it will take far longer (~1h per epoch ~= 8.3 days) to get to 200 training epochs than the 1-2 days your paper mentioned for bounding box inputs using:torch.set_num_threads(4)
but this did not affect training time (including with a number of threads >4)2) For my training (first 10 epochs) thus far KL divergence is increasing, I wondered if this was expected behavior?