Sicos1977 / TesseractOCR

A .net library to work with Google's Tesseract
167 stars 21 forks source link

Add OCR Text layer to existing PDF #54

Closed DoCode closed 8 months ago

DoCode commented 8 months ago

Is it possible to add a OCR text layer to an existing PDF file?

Like:

Results in searchable PDF.

Sicos1977 commented 8 months ago

Yes that is by default possible. Look inside the Tesseract manual about how you need to do it. You have all kind of different output formats Like HOCR, PDF, Html, etc....