X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Apache License 2.0
1.12k stars 68 forks source link
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding

The Powerful Multi-modal LLM Family for OCR-free Document Understanding

Alibaba Group

DocOwl | Trendshift

📢 News

🤖 Models

📺 Online Demo

Note: The demo of HuggingFace is not as stable as ModelScope because the GPU in ZeroGPU Spaces of HuggingFace is dynamically assigned.

📖 DocOwl 1.5

📈 TinyChart-3B

🌰 Cases

images

Related Projects