Closed sky-fly97 closed 2 months ago
Hi, @sky-fly97 , our released models are currently not finetuned or evaluated with multi-page samples (mPLUG-PaperOwl could support understanding of multiple diagram images but are not scheduled for release recently). Honestly, we're not quite sure whether these models could handle multi-page input. You can try inference with our docowl1.5 model.
Thanks, I will try. By the way, it seems that there are very few models on the market that can handle multi-page input, I've only seen qwen-vl-chat and GPT4V so far.
Thanks, I will try. By the way, it seems that there are very few models on the market that can handle multi-page input, I've only seen qwen-vl-chat and GPT4V so far.
Yes,there is still a lack of effective open-source methods for multi-image understanding.
Could you tell me which model in this series can support multiple-page inputs?