Data Argumentation for Training

Hi, thank you for your interest in our work. We indeed choose to cluster the representations from the center crop of the images. Then we use stronger data augmentation during training: https://github.com/facebookresearch/deepcluster/blob/9796a71abbfd14181a2b117d6244e60c2d94efbf/clustering.py#L142 Indeed, we consider that each augmented version of an image belongs to the cluster of its center crop. Actually, data augmentation during training is crucial for the method to work well and I think that using stronger data augmentation might improve furthermore the features quality. We didn't experiment on data augmentation for the clustering step though. Please re-open the issue if you have further questions.

facebookresearch / deepcluster

Data Argumentation for Training #52