InternLM / InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
1.91k stars 120 forks source link

求发布一个InternLM-XComposer2-4KHD的gradio_demo_composition.py #294

Closed a136214808 closed 2 months ago

a136214808 commented 2 months ago

求发布一个InternLM-XComposer2-4KHD的gradio_demo_composition.py

panzhang0212 commented 2 months ago

The 4KHD model is designed for multimodal understanding. So it does not support for composition.

We release the gradio code to chat with InternLM-XComposer2-4KHD. Please refer to

https://github.com/InternLM/InternLM-XComposer/blob/main/examples/gradio_4khd_chat.py

yuhangzang commented 2 months ago

Kindly re-open if you still have any issues.