hpanwar08 / detectron2

Detectron2 for Document Layout Analysis
Apache License 2.0
185 stars 63 forks source link

Extracting the text ? #1

Closed nodechef closed 4 years ago

nodechef commented 4 years ago
  1. Do you have any idea of how we can extract text from predicted BBOX ? Like Is there any inbuilt OCR ?
hpanwar08 commented 4 years ago

There is no inbuilt OCR in detectron2. You can use Tesseract4 for this or you can use tesserocr (https://github.com/sirfz/tesserocr) a python wrapper on tesseract.