NorbertZheng / read-papers

My paper reading notes.
MIT License
7 stars 0 forks source link

Sik-Ho Tsang | Review: DCGAN -- Deep Convolutional Generative Adversarial Network (GAN). #58

Closed NorbertZheng closed 1 year ago

NorbertZheng commented 1 year ago

Sik-Ho Tsang. Review: DCGAN — Deep Convolutional Generative Adversarial Network (GAN).

NorbertZheng commented 1 year ago

Overview

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks.

In this story, Deep Convolutional Generative Adversarial Network (DCGAN), by Indico Research and Facebook AI Research (FAIR), is reviewed. With DCGAN, a hierarchy of representations is learned from object parts to scenes in both the generator and discriminator. This is a paper in 2016 ICLR with about 6000 citations.

NorbertZheng commented 1 year ago

A Set of Constraints for Stable Training

All convolutional net replaces deterministic spatial pooling functions (such as max pooling) with strided convolutions (i.e. dilation convolution).

Eliminating Fully Connected Layers.

Batch Normalization (BN).

Activation Functions.

NorbertZheng commented 1 year ago

Network Architecture

Generator

image

NorbertZheng commented 1 year ago

Experimental Results

NorbertZheng commented 1 year ago

DCGAN Trained on ImageNet, Tested on CIFAR10 & SVHN

image Accuracy (%) on CIFAR10.

NorbertZheng commented 1 year ago

Walking in the Latent Space

image Interpolation between a series of 9 random points in Z show that the space learned has smooth transitions.

NorbertZheng commented 1 year ago

Visualizing the Discriminator Features

image Left: Random Filter Baseline, Right: Trained Filters.

NorbertZheng commented 1 year ago

Forgetting to Draw Certain Objects

image First Row: Models Without Dropping “Window” Filters, Second Row: Models With Dropping “Window” Filters.

NorbertZheng commented 1 year ago

Vector Arithmetic on Face Samples

image Smiling Woman — Neutral Woman + Neutral Man = Smiling Man.

image Man With Glasses — Man Without Glasses + Woman Without Glasses = Woman With Glasses.

image A ”turn” vector was created from four averaged samples of faces looking left vs looking right.

image Vector Arithmetic on Input Space.

NorbertZheng commented 1 year ago

Reference

[2016 ICLR] [DCGAN] Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks.

NorbertZheng commented 1 year ago

Generative Adversarial Network

[GAN] [CGAN] [LAPGAN] [DCGAN]