OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
https://internvl.readthedocs.io/en/latest/
MIT License
6.15k stars 478 forks source link

[Docs] For intergrading #716

Open Vital1162 opened 1 week ago

Vital1162 commented 1 week ago

📚 The doc issue

Is there any tutor for integrating the vision model with the language model?

Suggest a potential alternative/fix

No response

czczup commented 1 week ago

Hello, are you referring to using a vision model and a language model to build an MLLM?

Vital1162 commented 6 days ago

I'm impressed by InternVL and would like to have a tutorial/documentation on how you combine these models (vision model + MLP + LLMs) together so that they can be more accessible to newbies like me. Thank