THU-MIG / RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything
https://arxiv.org/abs/2307.09283
Apache License 2.0
681 stars 55 forks source link

need clip pretrain 👅👅👅👅 #26

Closed ZHEQIUSHUI closed 7 months ago

jameslahm commented 7 months ago

Thanks for your interest. Do you mean the mask decoder supporting text prompt?

ZHEQIUSHUI commented 7 months ago

just like openai/CLIP 😘

jameslahm commented 7 months ago

Thanks. This project is not so relevant to CLIP.