COMM (which has already been on Arxiv on 2023.10 https://arxiv.org/pdf/2310.08825.pdf) already proposed to merge the features of CLIP and DINOv2 to realize MLLM, maybe this paper should cite this reference. #4
Many thanks for your excellent work!
COMM (which has already been on Arxiv on 2023.10 https://arxiv.org/pdf/2310.08825.pdf) already proposed to merge the features of CLIP and DINOv2 to realize MLLM, maybe this paper should cite this reference.
Thank you for your insightful suggestion and valuable feedback!
We will cite this important reference in the related work section of our future versions.
Many thanks for your excellent work! COMM (which has already been on Arxiv on 2023.10 https://arxiv.org/pdf/2310.08825.pdf) already proposed to merge the features of CLIP and DINOv2 to realize MLLM, maybe this paper should cite this reference.