PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/
Apache License 2.0
44.48k stars 7.84k forks source link

版面分析如何开始训练自己的数据 #9262

Closed AIwang666 closed 1 year ago

AIwang666 commented 1 year ago

RT O9(ANS(H$N2G_%C%J7(}J~T (1)请问进行版面分析训练 想要重新训练自己的数据的话是要下载上图中的训练模型嘛,以CDLA为例这个训练模型能不能不局限于CDLA的数据集场景 可以在自己场景的数据进行训练后得到一定的效果吗 (2)还是说CDLA的训练模型只能处理该类型数据集的数据

KyleWang-Hunter commented 1 year ago

RT O9(ANS(H$N2G_%C%J7(}J~T (1)请问进行版面分析训练 想要重新训练自己的数据的话是要下载上图中的训练模型嘛,以CDLA为例这个训练模型能不能不局限于CDLA的数据集场景 可以在自己场景的数据进行训练后得到一定的效果吗 (2)还是说CDLA的训练模型只能处理该类型数据集的数据

可以在预训练模型的基础上对自己的数据集进行fineturn

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.