Open tshrinivasan opened 4 years ago
For the web version of Tesseract please explore this extension https://wikisource.org/w/index.php?title=User:Putnik/TesseractOCR.js&action=raw&ctype=text/javascript
using this extension ocr is done in wikisource website using tesseract.
I have used 2 GUI front-end for tesseract on Windows. These are vietocr and gimagereader.
Create a Web/Windows version of Google OCR/Tesseract
Explore this https://github.com/kaniyamfoundation/pdf2text
and make it as GUI version for windows/linux/mac (with GTK or QT or TK) and a web version like http://tts.kaniyaam.com
I started implementing a GUI using a Python Module called PySimpleGUI (https://pysimplegui.readthedocs.io/en/latest/)
Ghostscript(to convert PDF to JPG) and PyPDF2 were chosen for this implementation. (pdfseperate needs to be installed as a Desktop application in windows So that we can access its command in Windows CMD. So, I chose PyPDF2 to split PDF into single pages its a Python module so we can install it quickly. ) The main concern in this implementation is we need to check the performance/loading time of the final application
I will keep updating the implementation process in this thread.
Thanks. Share the repo.
https://github.com/Parathantl/tesseract_gui/tree/master/PySimpleGui
I did the basic Implementation. Which takes the folder path of PDF from users through GUI. Then, the respective folders for single JPG files, Text files to be saved and file path of Ghostscript.
I will further work on this to shape up the app. I need to test the performance of the execution.( I have some issues with my Laptop performance.)
Finally the Windows GUI version is released.
Here a linux version to OCR a given PDF file https://gist.github.com/tshrinivasan/0aaf78e5808ee29490928614882cded0
Here is a windows GUI version https://github.com/Parathantl/tesseract_gui/releases
Demo video in tamil - https://www.youtube.com/watch?v=363DGNL-rUw
Detailed notes are here https://goinggnu.wordpress.com/2020/05/23/tesseract-ocr-gui-for-windows/
Thanks to @Parathantl for the windows version.
Create a Web/Windows version of Google OCR/Tesseract
Explore this https://github.com/kaniyamfoundation/pdf2text
and make it as GUI version for windows/linux/mac (with GTK or QT or TK) and a web version like http://tts.kaniyaam.com