Open matt-laird opened 11 months ago
for a long time I have also faced this issue, is just a string trim fine ? so is there something with tesseract that I should configure any ideas ?
I had a brief look, it does seem to be an artifact from Tesseract's process, maybe give this a read and see if the different options help at all - Tesseract FAQ, unfortunately I can't test these myself right now.
I think we can trim the string for now I guess, thanks now I also got the Exact unicode to find and remove
There seems to be a![image](https://github.com/RajSolai/TextSnatcher/assets/14907718/073861fd-0acf-4003-a2bd-d9ec96bfbcc5)
U+000c
invisible Unicode character at the end of all generated text. This causes problems in some applications when pasting resulting text. See below example, problem on line 2: