Closed dstoekl closed 3 years ago
@dstoekl Thank you for pointing out the issue, I have fixed the bug and released a newer version of the package. Also, I have created this sample notebook on colab where you can test out the library, for using other language you will have to install language pack from tessaract
Notebook: https://colab.research.google.com/drive/1a4lCsxedHGIFgpoyHWYF5v4fLVxKmZpQ?usp=sharing
installing language packs: https://ocrmypdf.readthedocs.io/en/latest/languages.html
Great! Very helpful! Many thanks!
Hi there. on colab seizing: from multilingual_pdf2text.pdf2text import PDF2Text gives:
TypeError Traceback (most recent call last)