Discriminator Activation

Actually we use Fully Convolutional Networks in Discriminator, maybe you have noticed that before we use avg_pool we have this code: model += [nn.Conv2d(512, 1, 4, padding=1)] after this the output channel is 13030, and next we use code return F.avg_pool2d(x, x.size()[2:]).view(x.size()[0], -1), then return channel shape is 1*1, so there is no need to use sigmoid function. We use the average pooling method instead of the fully connection layer can reduce the network size, and it helps to reduce over-fitting.

aitorzip / PyTorch-CycleGAN

Discriminator Activation #8