Open KaraElan opened 1 month ago
Latin letters are included because they still appear in some scenes. Regarding the problem you mentioned, maybe you need to regenerate a dictionary containing only cyrillic characters to retrain the model. Or delete the Latin letters in the recognition results (but this is not an effective method, I think)
As far as I understand, currently it is a bug of Cyrillic-based languages recognition.
Discussed in https://github.com/PaddlePaddle/PaddleOCR/discussions/13309