CircleRadon / TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".
148 stars 6 forks source link

really like mini-gemini #9

Closed Birdylx closed 1 month ago

LiWentomng commented 1 month ago

@Birdylx Thank you for your attention. We acknowledge that that our TokenPacker shares similar insights with the interaction module for dual-encoder in Mini-Gemini; however, our method diverge in details, particularly in the implementation. Besides, the motivations and framework of our work also differ from Mini-Gemini. Furthermore, our method achieved the better performance with a smaller token number compared to Mini-Gemini-HD.