tshrinivasan / OCR4wikisource

OCR for WikiSource using Google Drive OCR
GNU General Public License v2.0
33 stars 24 forks source link

Make a web version of OCR4WikiSource #89

Open tshrinivasan opened 7 years ago

tshrinivasan commented 7 years ago

The OCR4Wikisource is a python script that runs only on GNU/Linux and in commandline. Many new users are feeling tough to setup and execute this.

A web version of the same tool is required, so that any new user can use it easily via browser.

Requirements

  1. user login with wiki credentials,
  2. give a URL of PDF file or upload PDF file.
  3. select wikisource language
  4. Give email address for notification
  5. These details are stored in a queue.
  6. OCR4Wikisource should read the queue, OCR it and paste in wikisource
  7. Once done, notify the user.

Can anyone volunteer for creating a web version?

samwilson commented 7 years ago

Can you elaborate on step 6 "OCR4Wikisource should read the queue, OCR it and paste in wikisource" — does this mean the tool itself would add the text to the relevant page on Wikisource? Or the user would copy and paste the text there?

What differences in workflow or features are there with respect to the proofreadpage system of proofreading a page at a time within wikisource?

I'm wondering if the ws-google-ocr tool could be modified to selectively either use the Vision API or the Drive system of OCR.

bodhisattwawiki commented 7 years ago

1) Yes, the script itself adds the texts to relevant pages. Users dont have to do it manually.

2) This script also does OCR one book at a time in contrast to the existing OCR (Phe or ws-google-ocr) system, where single page is OCRed at a time.

bodhisattwawiki commented 7 years ago

@samwilson , we have a test file for Bengali Wikisource. Please feel free to test with it using OCR4Wikisource script.