Closed Witiko closed 4 years ago
See issue #175.
Prediction of a page
Currently only OCR on lines is supported. Modules to segment pages into lines will be available soon. In the meantime you should use the scripts provided by OCRopus.
@ChWick, about the 'soon' part, it was stated in commit e3e6099a7045, in April 2018. Maybe you should remove it.
Consider the following image of the 2018 JLCL Calamari paper abstract:
Running
calamari-predict
with the the pretrained antiqua model on this image produces an empty prediction:Is this expected behavior, or is there some mistake on my part? If there is no mistake on my part, could this be because the default models have been trained on an unrepresentative dataset?