中文模型 - Githubissues

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

https://paddlepaddle.github.io/PaddleOCR/

Apache License 2.0

43.9k stars 7.8k forks source link

中文模型 #69

Closed yyr6661 closed 4 years ago

yyr6661 commented 4 years ago

感谢大佬分享，太强了请问现在提供的中文模型只有inference.md中的两种吗？

LDOUBLEV commented 4 years ago

感谢关注，目前中文模型就inference.md中提到的了。这个模型是针对通用OCR识别，针对一些特殊场景的OCR，可以用这个模型去fine-tune

yyr6661 commented 4 years ago

inference.md中是db+crnn，其它算法想要得到中文模型只能自己训练了是吗？

LDOUBLEV commented 4 years ago

对，暂时只能自己训练了，后续我们也会推出其他中文模型，可以保持关注

yyr6661 commented 4 years ago

非常感谢！

DuckJ commented 4 years ago

@LDOUBLEV 在体验你们的在线OCR服务时，识别过程中，中英混杂的时候，两个连续的英文单词在一起会导致空格识别不出来，请问这个问题有什么好的思路解决吗。如果是标记空格的话，数据集中存在的数据已经没有标记空格了，再去标也不现实