What are the `page_index` and `filename` arguments in ProcessPage() ?

sirfz / tesserocr

A Python wrapper for the tesseract-ocr API

MIT License

2.02k stars 254 forks source link

I'm trying to convert a PIL Image into a searchable PDF. For image files, ProcessPages(outbase, image_filename) works perfectly. For PIL Image, it seems ProcessPage() is the equivalent method. But there are two additional arguments. I tried setting:

page_index = 0
filename = "test"

It generated a corrupt PDF file. Can anyone please help me on proper usage of ProcessPage() method?

Some info that might be helpful:

Tesseract version = 4
Tesserocr version = 2.4.0
Python version = 3.6.7

sirfz / tesserocr

What are the `page_index` and `filename` arguments in ProcessPage() ? #167