PKU-YuanGroup / LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
https://arxiv.org/abs/2310.01852
MIT License
549 stars 44 forks source link

Inquiry on Unimodal Fine-Tuning with Locked Image in LanguageBind #41

Closed hexinyi2101 closed 4 days ago