CypherousSkies / reading-for-listeners

A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
GNU Affero General Public License v3.0
23 stars 3 forks source link

ocrmypdf Integration #3

Closed CypherousSkies closed 2 years ago

CypherousSkies commented 3 years ago

Todo:

CypherousSkies commented 3 years ago

Here's a post that should help fix bad ocr https://medium.com/states-title/using-nlp-bert-to-improve-ocr-accuracy-385c98ae174c

CypherousSkies commented 2 years ago

splitting remove page numbers to #9 as its going to take substantial work.

CypherousSkies commented 2 years ago

functionally ocrmypdf is implemented now, modulo opencv tasks (#9) which is beyond my scope for short term work