17CVPR| Image-to-Image Translation with Conditional Adversarial Networks

[paper] && [code]
Authors:
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros
Berkeley AI Research (BAIR) Laboratory, UC Berkeley

Highlight

This paper considers the image-to image translation problem as the input and output differs in surface appearance but both are renderings of the same underlying structure. The results suggest that the conditional adversarial networks are a promising approach for many image-to-image translation tasks, especially those involving highly structured graphical outputs.

They use a "U-Net"-based architecture as the generator and for the discriminator they use a convolutional "PatchGAN" classifier, which only penalizes structure at the scale of patches. The discriminator tries to classify if each NxN patch in an image is real or fake.

The experiments cast on a variety of tasks and datasets, including:

Semantic labels <-> photo, trained on the Cityscapes dataset.
Architectural labels -> photo, trained on the CMP Facades.
Map <-> aerial photo, trained on data scraped from Google Maps.
BW -> color photos.
Edge -> photo.
Sketch -> photo.
Day -> night.
Thermal -> color photos.
Photo with missing pixels -> inpainted photo.

XFeiF / ComputerVision_PaperNotes

17CVPR| Image-to-Image Translation with Conditional Adversarial Networks #19

Highlight

Some notes