jwilk-archive / ocrodjvu

OCR for DjVu
GNU General Public License v2.0
44 stars 19 forks source link

ValueError: need more than 0 values to unpack #18

Closed jwilk closed 8 years ago

jwilk commented 8 years ago

Issue reported by @jsbien:

The input file is practically identical with http://teksty.klf.uw.edu.pl/30/4/LindeIIGP%2B6i.djvu.

time ocrodjvu -D -t chars -e tesseract -l pol -p 1-3 --save-raw-ocr=hOCR4t --save-script tytularia.djvused LindeIIGP+6iN.djvu
Processing 'LindeIIGP+6iN.djvu':
- Page #1
tesseract: Tesseract Open Source OCR Engine v3.04.01 with Leptonica
tesseract: Page 1
tesseract: Tesseract Open Source OCR Engine v3.04.01 with Leptonica
tesseract: Page 1
- Page #2
tesseract: Tesseract Open Source OCR Engine v3.04.01 with Leptonica
tesseract: Page 1
tesseract: Tesseract Open Source OCR Engine v3.04.01 with Leptonica
tesseract: Page 1
Exception while processing page 2:
Traceback (most recent call last):
  File "/usr/share/ocrodjvu/lib/cli/ocrodjvu.py", line 429, in page_thread
    result = self.process_page(page)
  File "/usr/share/ocrodjvu/lib/cli/ocrodjvu.py", line 412, in process_page
    page_size=size
ValueError: need more than 0 values to unpack
Intermediate files were left in the '/tmp/ocrodjvu.oJcKMN' directory.

real    0m13.258s
user    0m12.312s
sys 0m0.180s
jwilk commented 8 years ago

Thanks, I reproduced the bug. This seems to happen with -t chars on empty pages.

jwilk commented 8 years ago

Fixed in 2c1164731715aa897bb08f63791427de1f61569f.

jwilk commented 8 years ago

Fixed in 0.9.2.