Rotating images before train/test data split

Hello.

In the manuscript, it is noted that This dataset was enhanced 10-fold by rotating the images, to make up the final dataset (37000 images).

However, this is done before the train/test split is made! Is it therefore fair to say that it is very likely that the test dataset simply contains rotated versions of the training data?

If this is true, it is of course problematic because although U-Net is not rotationally invariant, it is not fair to say that the test data is really unseen data. For example, see the discussion here: https://stats.stackexchange.com/questions/412992/data-augmentation-on-entire-dataset-before-splitting

wl-stepp / adaptive-imaging

Rotating images before train/test data split #2