fsingletonthorn / EffectSizeScraping

MIT License
1 stars 0 forks source link

build in OCR using the Tesseract OCR engine when PDF extraction fails to extract any text #18

Open fsingletonthorn opened 5 years ago

fsingletonthorn commented 5 years ago

Initial tests suggest that this will be too buggy to be useful - but may be better than nothing? This has been pushed down the timeline