Closed sleepybear1113 closed 6 months ago
@sleepybear1113
For our test cases, we set TESSDATA_PREFIX
environment variable to various values: D:\Test\tessdata-á
, D:\Test\tessdata-â
, and D:\Test\tessdata-ấ
, and run tesseract --list-langs
command for each. It worked with the first two cases, which use extended ASCII characters, but not with the last one, which contains a Unicode character. Tesseract engine apparently does not support Unicode characters in tessdata
path.
Duplicate of Issue #190
I would like to report an issue regarding setting a non-English datapath in Tess4J. Currently, the library does not support using a datapath with Chinese characters, which limits its usability for users with non-English paths.
Example:
D:/测试路径/eng.traineddata
, I set dataPath toD:/测试路径
, it will printwhen using path
D:/test
, it works.Is it possible to modify the library's code to support non-English paths, such as setting a datapath with Chinese characters? This would greatly enhance the flexibility and usability of Tess4J for a wider range of users.
[The above content is built using gpt, the original text is from Chinese ]