In the Masked Pretraining section, there seems to be an issue with the way the CLIP model is loaded. In the extract.ipynb notebook, the code model, _ = clip.load("ViT-B/16", device='cpu') is used, but the CLIP code repository does not have a load module, and the import import clip.clip as clip does not seem to be the correct way to do it.
In the Masked Pretraining section, there seems to be an issue with the way the CLIP model is loaded. In the extract.ipynb notebook, the code model, _ = clip.load("ViT-B/16", device='cpu') is used, but the CLIP code repository does not have a load module, and the import import clip.clip as clip does not seem to be the correct way to do it.