Reason behind KL divergence

hzlsaber / IPMix

The offical repository of "IPMix: Label-Preserving Data Augmentation Method for Training Robust Classifiers"

MIT License

12 stars 1 forks source link

Dear @hzlsaber

In training loop, you have implemented KL divergence " # Clamp mixture distribution to avoid exploding KL divergence". what is the reason behind this? What is the benefit of it? In PixMix paper, they didn't implement it, that's why I am asking because there are less method which utilized fractal images.

Are you using clean images and augmented images for training? images_all = torch.cat(images, 0).cuda() When we apply augmentation then we use augmented data instead of original training data.

Regards, Khawar

hzlsaber / IPMix

Reason behind KL divergence #7