PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/
Apache License 2.0
44.65k stars 7.86k forks source link

训练外语模型,无法识别空格 #13990

Open mxdg opened 1 month ago

mxdg commented 1 month ago

🔎 Search before asking

🐛 Bug (问题描述)

基于ch_PP-OCRv4_rec_hgnet.yml训练的新语言ocr识别模型,设置了 use_space_char: true 词典加了空格字符,训练图片也有空格 但是训练出来的模型,却无法识别空格,不知道哪里设置错了

🏃‍♂️ Environment (运行环境)

python 3.10

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

python 3.10

Liyulingyue commented 1 month ago

可以提供一下示例图片吗?

XiaokangLei commented 2 weeks ago

🔎 Search before asking

  • [x] I have searched the PaddleOCR Docs and found no similar bug report.
  • [x] I have searched the PaddleOCR Issues and found no similar bug report.
  • [x] I have searched the PaddleOCR Discussions and found no similar bug report.

🐛 Bug (问题描述)

基于ch_PP-OCRv4_rec_hgnet.yml训练的新语言ocr识别模型,设置了 use_space_char: true 词典加了空格字符,训练图片也有空格 但是训练出来的模型,却无法识别空格,不知道哪里设置错了

🏃‍♂️ Environment (运行环境)

python 3.10

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

python 3.10

字典不加空格字符