alipay / Ant-Multi-Modal-Framework

Research Code for Multimodal-Cognition Team in Ant Group
Creative Commons Attribution 4.0 International
60 stars 2 forks source link

Add Pink Implementation #10

Closed LandyGuo closed 2 months ago

LandyGuo commented 2 months ago

Implementation for our CVPR2024 paper Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs