Closed lucasjinreal closed 9 months ago
We expect to release all details in the following days. Meanwhile, please refer to our paper for more information. https://arxiv.org/abs/2402.14289
We have updated our README on data preparation
@baichuanzhou thanks, Do u think there any good data to enhance OCR ability? Currently I found the OCR ability especially Chinese are very weak.
@lucasjinreal Emmm, I think increasing data and increasing resolutions are both important to improve OCR abilities. As for training data, maybe look at ChartQA, DVQA, etc? Anyway, further exploration in this area is needed.
@baichuanzhou How to enlarge the vit input size? since if make the size changed, the weights should not properly work
Opensource community would be benefit from it.