BradyFU / Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
10.88k stars 721 forks source link

Add Ovis: Structural Embedding Alignment for Multimodal Large Language Model #157

Open runninglsy opened 1 month ago

runninglsy commented 1 month ago

Thank you for your efforts in maintaining this repository. We would like to propose the addition of our recent paper (https://arxiv.org/pdf/2405.20797) on multimodal large language models to the list of references.

xjtupanda commented 1 month ago

Thanks for sharing. The work has been added to the repo. Please also consider citing the survey paper:

@article{yin2023survey,
  title={A Survey on Multimodal Large Language Models},
  author={Yin, Shukang and Fu, Chaoyou and Zhao, Sirui and Li, Ke and Sun, Xing and Xu, Tong and Chen, Enhong},
  journal={arXiv preprint arXiv:2306.13549},
  year={2023}
}
runninglsy commented 1 month ago

Thanks for the reference. We have open-sourced our work on GitHub and cited the excellent survey paper in the new arXiv version. Please consider updating the GitHub link as indicated in the new commit: https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/pull/157/commits/699e908b522b4cb4d53cffd39ab7775a322da07e

xjtupanda commented 1 month ago

Sure. We've updated the corresponding item.