vturrisi / solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
MIT License
1.38k stars 181 forks source link

Train/Test Split for Pretraining via imagenet #361

Closed LightingMc closed 11 months ago

LightingMc commented 11 months ago

What is the train-test split used for the pre-training on ImageNet? Do you have a file that describes which image goes where? What is the split percentage?

vturrisi commented 11 months ago

It's the default train/val splits for imagenet.

LightingMc commented 11 months ago

Where can I find details of that?

vturrisi commented 11 months ago

It already comes divided into train and val. You can download it from multiple sources, e.g. https://huggingface.co/datasets/imagenet-1k/tree/main