ChenDelong1999 / RemoteCLIP

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
https://arxiv.org/abs/2306.11029
Apache License 2.0
227 stars 13 forks source link

CLIP Format #32

Open Luo-Z13 opened 4 days ago

Luo-Z13 commented 4 days ago

Could you please guide me on how to convert a ".pt" format model into the Hugging Face format (similar to the one at https://huggingface.co/laion/CLIP-ViT-g-14-laion2B-s12B-b42K/tree/main? It seems to require a JSON file to load the model.

Thank you!

Luo-Z13 commented 4 days ago

Specifically, It seems the .pt format can not be loaded via the function CLIPVisionConfig.from_pretrained.