Open kasivisu82 opened 5 years ago
I've ran initially SRRESNET with VGG54 for 10 power 5 iterations.
How do the results of this training look? If I recall correctly, in the paper they train SRResNet with MSE for 10^6 iterations as the initial seed (even for SRGAN VGG54) (10^5 iterations should be fine too). I've had problems with bad local minima when using VGG54 loss without the GAN, so maybe this could be related to your problem.
I haven't encountered these artifacts with SRGAN, but with a similar architecture I experienced the exact same artifacts from iteration to iteration. I suspect that it could just be due to the instability of GANs. You can pick just results from an iteration without artifacts if you are primarily concerned about visual quality. If you only care about the metrics then pick the iteration with the best metrics - the metrics don't correspond with visual quality that well anyways.
Hi Trevor,
Thank you for your clarifications.
Based on our discussion on your feedback, we've decided to try a few things, which are given below.
Kindly let me know your comments on all the 3 steps.
Sorry for the delay.
This sequence should reproduce the paper's result:
python train.py --name srresnet-mse --content-loss mse --train-dir path/to/dataset
# wait for srresnet-mse to train for 10^6 iterations
python train.py --name srgan-vgg54 --use-gan --content-loss vgg54 --train-dir path/to/dataset --load results/srresnet-mse/weights-1000000
# wait for srgan to train for 10^5 iterations
python train.py --name srgan-vgg54 --use-gan --content-loss vgg54 --train-dir path/to/dataset --load results/srgan-vgg54/weights-100000 --learning-rate 1e-5
# wait for srgan to train for 10^5 iterations
Hi,
i've used your implementation for SRGAN. The steps are below. i've used 800 images from div_2k as training dataset and 90 images from div_2k as test images. I've ran initially SRRESNET with VGG54 for 10 power 5 iterations. Then used the obtained weights to initialize (--load) for SRGAN with VGG54 and ran for another 10 power 5 iterations.
The PSNR and SSIM are given below for Set5, Set14, BSD100 as well as Div_2K's 90 images (all average values): [BSD100] PSNR: 25.18, SSIM: 0.6398 [Set14] PSNR: 26.25, SSIM: 0.6966 [Set5] PSNR: 29.33, SSIM: 0.8370 [div2k-90] 26.48, SSIM: 0.6984
But some of the images (from all the 4 sets given above) had artifacts. (Please see the attached images). The artifacts are present in some iterations and not present in some iterations. But the iterations with low training loss also has artifacts.
Can you help me with the following questions?
Expecting your reply, as i'm struck in this a bit.