Open gordonhu608 opened 1 month ago
Thanks for sharing. Your work has been added to the repo. Please also consider citing our survey paper:
@article{yin2023survey,
title={A Survey on Multimodal Large Language Models},
author={Yin, Shukang and Fu, Chaoyou and Zhao, Sirui and Li, Ke and Sun, Xing and Xu, Tong and Chen, Enhong},
journal={arXiv preprint arXiv:2306.13549},
year={2023}
}
Thanks so much, we are updating arXiv papers and we will definitely cite your survey !!!!
A new awesome work comes out! MQT-LLaVA: Matryoshka Query Transformer for Large Vision-Language Models. https://github.com/gordonhu608/MQT-LLaVA