COMM (which has already been on Arxiv on 2023.10 https://arxiv.org/pdf/2310.08825.pdf) already proposed to merge the features of CLIP and DINOv2 to realize MLLM, maybe this paper should cite this reference. #18
COMM (which has already been on Arxiv on 2023.10 https://arxiv.org/pdf/2310.08825.pdf) already proposed to merge the features of CLIP and DINOv2 to realize MLLM, maybe this paper should cite this reference. Many thanks for your excellent work!
COMM (which has already been on Arxiv on 2023.10 https://arxiv.org/pdf/2310.08825.pdf) already proposed to merge the features of CLIP and DINOv2 to realize MLLM, maybe this paper should cite this reference. Many thanks for your excellent work!