csjliang / LPTN

Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021
Apache License 2.0
411 stars 46 forks source link

LS-GAN & Multi-scale discriminator #8

Closed echoofluoc closed 3 years ago

echoofluoc commented 3 years ago

In 3.4 Learning criteria, the paper mentions the adversarial loss is computed based on the LS-GAN and a multi-scale discriminator ,but when I look into the train_FiveK_paper.yml , it seems you didn't use those criterias?

As suggested in provided config yml, standard GAN and single discriminator are used to formulate adversarial learning , does the latter setting work better(in PSNR metric or better visual quality)?

csjliang commented 3 years ago

Hi. Thanks for your question. In our recent experiment, we found that the WGAN criteria (single discriminator with gradient penalty) lead to better performance both quantitatively and qualitatively, so we recommend this setting for follow-up works. One possible attempt in the future might be discriminating on each frequency band and construct a 'multi-frequency discriminator'.