amebalabs / TRex

Copy any text on your screen, stop retyping.
https://trex.ameba.co
MIT License
1.24k stars 40 forks source link

Tesseract integration for more languages #33

Open pramjan opened 3 months ago

pramjan commented 3 months ago

Hi,

Hope you are doing well!

Is there a way you can harness the tesseract (which is already installed on my system - it just needs to run using a shortcut or be available via the menubar like your app) command line application, which can provide OCR output for many other languages?

Thank you for your consideration!

Best, Pramod

melonamin commented 3 months ago

It is quite a sizable feature to add, will require a lot of work.

What languages are you looking for specifically?

pramjan commented 3 months ago

Hmm, I didn't realize it would be a lot of work. I was thinking maybe run the tesseract command with the captured image (in the background), and display the result in your app like usual. I'm from India, and looking for OCR for Telugu, Hindi (this is just a parameter to pass to tesseract) This command for the Telugu language would be like:

tesseract input_image.png output_file -l tel
melonamin commented 3 months ago

Yeah, I understand, but I want TRex to be easy to use and work out of the box. I can't ask user to install additional tool on aside.

So to do it properly I need to take tessaract library and properly integrate it. Let me think about it a bit, there are a lot of potential users in India 😄