Closed LiYuhangUSTC closed 8 months ago
Hi yuhang, for 1. I personally tried to add ColorJitter but found it would hurt performance. Actually MAE, SimMIM and more related masked image modeling work also only use such minimal augmentations (flip & randresizedcrop). I think the learning of SparK (also MAE or SimMIM) is to model the pixel distribution itself (like learning some pixel correlation or dependency). So any distortion or inappropriate transformation to the distribution (like ColorJitter) can mislead the learning and thus be harmful.
For 2, I would only consider adding any extra augmentation if: i. the data is extremely insufficient ii. the augmentation is quite natural and reasonable for the target image distribution: it'll always produce in-distribution augmented images, rather than adding noise to the data distribution.
Thank you for the quick and valuable reply! I agree that maintaining the data distribution is important in the pretraining stage. Should be more careful when selecting augmentations.
Hi, thanks for your work. What a great job!
I am trying to use SparK to get a pretrained model for my target task. I have a few questions.