facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
Other
2.83k stars 358 forks source link

config files about vit-small and vit-base #39

Open stonewjf opened 1 year ago

stonewjf commented 1 year ago

can you tell me vit-small and vit-base training configuration

paritosh-101 commented 8 months ago

My config file for vit_small. Adjust the patch size according to the image dimensions:

data: batch_size: 128 color_jitter_strength: 0.0 crop_scale:

Oguzhanercan commented 7 months ago

when I change the crop_size, an error occurs about masks, can you share your mask code?

paritosh-101 commented 7 months ago

I did not make any changes to the mask code, just adjusted the patch size according to the crop size (which is equal to the img size in my case). You can see this file (scroll down to the bottom): "src/models/vision_transformer.py", to get a better idea of adjustments.