X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Apache License 2.0
1.63k stars 101 forks source link
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding

The Powerful Multi-modal LLM Family for OCR-free Document Understanding

Alibaba Group

DocOwl | Trendshift

πŸ“’ News

πŸ€– Models

πŸ“Ί Online Demo

Note: The demo of HuggingFace is not as stable as ModelScope because the GPU in ZeroGPU Spaces of HuggingFace is dynamically assigned.

πŸ“– DocOwl 1.5

πŸ“ˆ TinyChart-3B

🌰 Cases

images

Related Projects