Closed cytotoxicity8 closed 2 years ago
I've tried Horizontal Flip, VerticalFlip, and random rotation, but performance drops across multiple categories.
Yes it is true. The authors also mentioned it. I think the reason is based on the symmetry of many categories. However the authors didn't let us know categories that they applied data augmentation. So I guess data augmentation should be an option of the code.
Nice catch @cytotoxicity8 . I didn't perform data augmentation because the paper didn't give enough details on what kind of augmentation on which categories. However, the paper shows that performance without augmentation doesn't have a significant drop. So I just omitted it
As you can see in "6.2 Training Data Augmentation", the authors applied some data augmentation methods. Even though they introduced the results without data augmentation methods, table 2 seems to be with the methods. (Compare AUC in the table 8.) So I think the dataset.py should be changed like below: (*I didn't apply random rotation.)