PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Apache License 2.0
38.99k stars 7.32k forks source link

繁体模型有更新计划么 #11991

Closed TOMORINAONAO closed 2 weeks ago

TOMORINAONAO commented 3 weeks ago

1.目前测试v3的繁体模型效果是比较差的,看到多语言模型里日文和韩文都有v4版的,繁体模型什么时候有新的? 2.我测试时是网上复制一些新闻内容或公文,然后转成繁体字,在word里改变了几种字体测试。发现不同字体的识别效果都不太一样,标点符号漏的也很多,识别的准确率也不是很高,大概是70多的准确率。

Sunting78 commented 3 weeks ago

您好,如果有高的精度要求可以使用真实数据finetune一下