Question about Augmentation

NVlabs / edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Other

1.37k stars 143 forks source link

It appears the score network is also conditioned on the augmentation parameters during training, which gets mapped into a embedding just like the noise (in fact they are summed together). So it can be thought of as training a score network over an ensemble of different data distributions. It looks like at generation time the augmentation parameters passed to the network are all zeros.

My educated guess is that it may still be possible for the augmentations to leak into the generation, it depends on to what extent the network has learned that the zero vector indeed corresponds to the real data distribution.

NVlabs / edm

Question about Augmentation #17