LCFractal / TGDT

Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
MIT License
21 stars 2 forks source link