BradyFU / Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.59k stars 750 forks source link

Integration Request: 'Honeybee: Locality-enhanced Projector for Multimodal LLM' #95

Closed khanrc closed 8 months ago

khanrc commented 9 months ago

Hello,

Thank you for this great project!

Could you add our recent work, "Honeybee: Locality-enhanced Projector for Multimodal LLM"? We used MME as an evaluation benchmark in this work, thanks! :)

xjtupanda commented 8 months ago

Thanks for sharing! We have added the paper to our repo. Please also consider citing our survey paper, which tracks the frontier of MLLMs:

@article{yin2023survey,
  title={A Survey on Multimodal Large Language Models},
  author={Yin, Shukang and Fu, Chaoyou and Zhao, Sirui and Li, Ke and Sun, Xing and Xu, Tong and Chen, Enhong},
  journal={arXiv preprint arXiv:2306.13549},
  year={2023}
}
khanrc commented 8 months ago

Thanks!