water ripple artifacts when using CoBi Loss

S-aiueo32 / contextual_loss_pytorch

Contextual Loss (CX) and Contextual Bilateral Loss (CoBi).

MIT License

172 stars 45 forks source link

water ripple artifacts when using CoBi Loss #3

Open conson0214 opened 4 years ago

conson0214 commented 4 years ago

when i training ESRGAN using Contextual Bilateral Loss, without l1/perceptual/GAN loss all the inference results seem to have artifacts like water ripple in smooth areas Do you have any idea how might this artifact come about?

S-aiueo32 commented 4 years ago

@conson0214 I think it will be caused by using only CoBi{VGG} in Eq. (3). Even though it causes over smoothing, the pixel wise loss has very important roles In SR problems. So I suggest to use CoBi{RGB}.

conson0214 commented 4 years ago

the loss function in my traning codes as following, i use CoBi_{RGB} as a part of total loss, but i`m wondering if I define it properly?

import contextual_loss as cl

criterion_rgb = cl.ContextualBilateralLoss(use_vgg=False, loss_type='cosine').to(self.device) l_cobirgb = criterion_rgb(self.fake_H, self.real_H)

criterion_relu1_2 = cl.ContextualBilateralLoss(use_vgg=True, loss_type='cosine', vgg_layer='relu1_2').to(self.device) l_cobirelu1_2 = criterion_relu1_2(self.fake_H, self.real_H)

criterion_relu2_2 = cl.ContextualBilateralLoss(use_vgg=True, loss_type='cosine', vgg_layer='relu2_2').to(self.device) l_cobirelu2_2 = criterion_relu2_2(self.fake_H, self.real_H)

criterion_relu3_4 = cl.ContextualBilateralLoss(use_vgg=True, loss_type='cosine', vgg_layer='relu3_4').to(self.device) l_cobirelu3_4 = criterion_relu3_4(self.fake_H, self.real_H)

l_total = l_cobirgb + l_cobirelu1_2 + l_cobirelu2_2 + 0.5*l_cobirelu3_4

S-aiueo32 commented 4 years ago

The paper uses n×n RGB patches as features for CoBi_RGB. Your code compares single RGB value only.

conson0214 commented 4 years ago

How to calculate CoBi_RGB using nxn RGB patches as features? I`m confused when reading this part of paper. like this 1/wh Σ{criterion_rgb(self.fake_H(nxn), self.real_H(nxn))}? calculate CoBi_RGB per pixel in its nxn neighbourhood, then average them in whole image? Am I right?

S-aiueo32 commented 4 years ago

It probably means the input of CoBi_RGB should be the vectors of n×n RGB values. Currently, this package does not support the feature conversion so you need to define it outside of the package.

For example:

# dummy image, shape: (n, c, h, w)
img = torch.rand(n, c, h, w)

# sample patches, shape: (n, c, kernel_size, kernel_size, n_patches)
patches = sample_patches(
    img, kernel_size=3, stride=2, padding=0)

# convert to vectors, shape: (n, c*kernel_size*kernel_size, n_patches, 1)
vectors = patches.reshape(n, -1, n_patches, 1)

criterion = ContextualBilateralLoss()
loss = criterion(vectors, vectors)