Generating text embedding files

yhosoya66 commented 4 months ago

Hi, thank you for your excellent work and the well-organized code you've shared. I really appreciate it!

I'd like to ask a few questions, if that's okay.

I'm interested in fine-tuning F-ViT from CLIPSelf (available in this repository) on a different dataset. For this purpose, we need to create embedding files like 'datasets/embeddings/coco_with_background_evaclip_vitb_16.pt', which are essentially text embeddings for the target dataset, right?

Here are my questions:

How can I create these embedding files for my own dataset? Could you provide some guidance or a script to make text embedding files for arbitrary dataset?
Is the same text encoder consistently used across all configuration settings? If so, could you tell me which model you used?

Thanks for your help.

wusize commented 4 months ago

Hi, please use this script to generate text embeddings. Make sure using the corresponding text encoder to the ViT model.

yhosoya66 commented 4 months ago

Thank you for your prompt reply, It works when I set the argument '--cache_dir' as the target pre-trained weight.

wusize / CLIPSelf

Generating text embedding files #8