Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
MIT License
1.82k stars 128 forks source link

Training data #78

Closed luohao123 closed 6 months ago

luohao123 commented 6 months ago

Will the Chinese OCR training data open?

echo840 commented 6 months ago

We release the training JSON of TextMonkey. You can find it at https://www.modelscope.cn/datasets/lvskiller/TextMonkey_data/summary . And TextMonkey was not trained using Chinese data.