Ucas-HaoranWei / Vary

[ECCV2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
1.65k stars 150 forks source link

关于demo运行问题 #94

Open whm233 opened 3 months ago

whm233 commented 3 months ago

请问运行的时候除了需要下载clip-vit-large-patch14这个模型外还需要作者训练的千问模型是吗?

Ucas-HaoranWei commented 3 months ago

是的