Closed lokeshh closed 4 years ago
Rerunning the code should skip already-OCR-ed segments, doesn't it? Look at the latest code in the repo for reference. If you get an idea about how it might be made better, send a pull request.
@vvasuki Ok I will try to fix it so that it ignores already done pdf parts.
@vvasuki My mistake. It indeed skips the parts which are already OCRed.
When OCRing a huge book (around 2000 pages), it fails with some exception every time for example this one
Sometimes the problem is with the network and sometimes the Google server itself returns 500 error.
Can we have a functionality or a hack to resume where it last stopped?