Closed shixiao1997 closed 4 years ago
This is because we want to simulate different incorrect configurations that the network might see. If you could get good results without data augmentation, it probably means the training data itself cover sufficient variations already.
hello, this great job inspired me a lot, but i have a question about the code. Why add perturbation during training? When I implement stn by myself, it also work normally without him.