Create a Web/Windows version of Google OCR/Tesseract

KaniyamFoundation / ProjectIdeas

A Place to write down the project ideas and to plan them

39 stars 3 forks source link

Create a Web/Windows version of Google OCR/Tesseract #80

Open tshrinivasan opened 4 years ago

tshrinivasan commented 4 years ago

Create a Web/Windows version of Google OCR/Tesseract

Explore this https://github.com/kaniyamfoundation/pdf2text

and make it as GUI version for windows/linux/mac (with GTK or QT or TK) and a web version like http://tts.kaniyaam.com

balajijagadesh commented 4 years ago

For the web version of Tesseract please explore this extension https://wikisource.org/w/index.php?title=User:Putnik/TesseractOCR.js&action=raw&ctype=text/javascript

using this extension ocr is done in wikisource website using tesseract.

Shreeshrii commented 4 years ago

I have used 2 GUI front-end for tesseract on Windows. These are vietocr and gimagereader.

sathia27 commented 4 years ago

Create a Web/Windows version of Google OCR/Tesseract

Explore this https://github.com/kaniyamfoundation/pdf2text

and make it as GUI version for windows/linux/mac (with GTK or QT or TK) and a web version like http://tts.kaniyaam.com

http://tts.kaniyam.com/

Shreeshrii commented 4 years ago

Also see https://www.kielipankki.fi/cgi-bin/ocr.py

Parathantl commented 4 years ago

I started implementing a GUI using a Python Module called PySimpleGUI (https://pysimplegui.readthedocs.io/en/latest/)

Ghostscript(to convert PDF to JPG) and PyPDF2 were chosen for this implementation. (pdfseperate needs to be installed as a Desktop application in windows So that we can access its command in Windows CMD. So, I chose PyPDF2 to split PDF into single pages its a Python module so we can install it quickly. ) The main concern in this implementation is we need to check the performance/loading time of the final application

I will keep updating the implementation process in this thread.

tshrinivasan commented 4 years ago

Thanks. Share the repo.

Parathantl commented 4 years ago

https://github.com/Parathantl/tesseract_gui/tree/master/PySimpleGui

I did the basic Implementation. Which takes the folder path of PDF from users through GUI. Then, the respective folders for single JPG files, Text files to be saved and file path of Ghostscript.

Capture1

I will further work on this to shape up the app. I need to test the performance of the execution.( I have some issues with my Laptop performance.)

tshrinivasan commented 4 years ago

Finally the Windows GUI version is released.

Here a linux version to OCR a given PDF file https://gist.github.com/tshrinivasan/0aaf78e5808ee29490928614882cded0

Here is a windows GUI version https://github.com/Parathantl/tesseract_gui/releases

Demo video in tamil - https://www.youtube.com/watch?v=363DGNL-rUw

Detailed notes are here https://goinggnu.wordpress.com/2020/05/23/tesseract-ocr-gui-for-windows/

Thanks to @Parathantl for the windows version.