Update for Pytorch 1.7 - Githubissues

LeviDPC commented 4 years ago

This project has a bug that has kept it from running on versions of Pytorch 1.5 or after. I'm pretty sure this is because in Pytorch 1.5 one of the bug fixes was: torch.optim optimizers changed to fix in-place checks for the changes made by the optimizer. If you want to know more find the patch notes and do a search for those key words.

A different pull request had a fix that ran but ended up with bad picture output.

The supplementary materials for SinGAN list one of the optimizations as alternating between 3 gradient steps for the generator and 3 gradient steps for the discriminator. This is different from the basic Pytorch DCGAN Tutorial (which SinGANs code seems to be partially based on) where this isn't the case and it simply alternates between generator and discriminator steps. It appears that the bug is coming from the OptimizerG step operation performing inplace operations on the Generator parameters. By removing this optimization to make it more like the tutorial I've fixed the bug.

I've done some initial testing and the model seems to work just fine. For performance, my fix seems to go slower initially but then catch up and surpass the current version. If anyone would like to test my changes, they are pushed to my fork of SinGAN: https://github.com/Clefspear99/SinGAN

-Clef

tamarott commented 3 years ago

Thank you Clef! Would be happy to hear if all tests worked fine. Should I push this to the main brunch?

LeviDPC commented 3 years ago

Hey tamarott,

Yeah, I'm glad to have helped and glad my suggestions didn't break anything further!

If you feel confident then by all means go ahead and push them.

Best, Clef

Thierryonre commented 3 years ago

Would push this so that people don't have to go through the issues to find out how to fix the bug - especially new people.

tamarott commented 3 years ago

This code seems to do a single D step per G step (in groups of three steps), and is therefore not identical to the original implementation (were 3 D steps are followed by 3 G steps). Identical performance is not granted.

alicehua11 commented 3 years ago

I didn't see this sooner. Spent sometimes trying to fix the problem, would be good to have a fix on this pull request for future people.

markstrefford commented 2 years ago

Thanks for this @Clefspear99 @tamarott one of the issues with the original code is the version of Torch doesn't work on more recent GPUs, such as the RTX30xx series. I appreciate that changing the approach for steps is different here, but at least this works pull request works with more recent hardware and CUDA/CUDNN versions from NVidia.

markstrefford commented 2 years ago

For those that want a version that works on more recent Pytorch versions (with the caveats above around not guaranteeing the exact same results), you can find it here https://github.com/Clefspear99/SinGAN (thanks for @Clefspear99 )

tamarott / SinGAN

Update for Pytorch 1.7 #126