maiaPhilippe / pdf-to-text

PDF OCR using Pure Javascript by tesseract.js api
20 stars 10 forks source link
javascript tesseract-ocr

PDF-TO-TEXT

PDF-to-Text is an OCR, Pure Javascript by tesseract.js api, mobile-ready that convert PDF text-image to text.

Tech

PDF-to-Text uses a number of open source projects to work properly:

Installation

PDF-to-Text requires Node.js v4+ or any server enviroment to run.

Start the server.

$ npm install http-server -g
$ cd pdf-to-text-master
$ http-server