UVicLibrary / Sufia-Head

0 stars 0 forks source link

OCR for uploads #2

Closed sephirothkod closed 8 years ago

sephirothkod commented 8 years ago

Add OCR software (tesseract?) for uploads.

sephirothkod commented 8 years ago

4410c97 adds this functionality for images only. Still working on PDFs, it looks like they will need to be separated in to individual page images/objects and then the have the ocr applied to each individual page as a txt file derivative.