deepglint / RWKV-CLIP

[EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner
MIT License
115 stars 8 forks source link