Closed stockpilothq closed 1 year ago
Internal Error: Cannot handle URI 'https://project.blob.core.windows.net/media/invoices/pdf/51200617.pdf'.
Seems like you are trying to parse a PDF hosted on a webpage, this is not supported. You need to download the file locally (to disk of memory) before trying to parse it.
Works, thanks so much!
Hi everyone,
I've set up a project which uses pdf2image. I installed Poppler with Brew and it works locally (on my MacOS) like a charm.
Production on the other hand drives me crazy. I setup a Dockerfile and added the following command: RUN apt update && apt-get install -y poppler-utils
CLI outputs:
Everything seems to be installed correctly. But the moment I try to convert a pdf_from_path I retrieve the following error:
PDFPageCountError: Unable to get page count. Internal Error: Cannot handle URI 'https://project.blob.core.windows.net/media/invoices/pdf/51200617.pdf'.
Python code:
The answers on this error I find by search are all related to poppler_path and windows, which does not help. Hope someone can tell me with this issue.
Thanks in advance.