Closed bdy9527 closed 4 years ago
In the consistency regularization
Here the L2 norm can be replaced by KL divergence. We adopt L2 norm because it achieves better performance in our experiments.
In the consistency regularization