Closed Readix1 closed 2 months ago
Hi @Readix1, and thanks for sharing this example. Based on (very helpful) stack trace you shared, the error you encountered appears to stem from pdfminer.six, pdfplumber's main dependency. My guess is that it relates to the way PyPDF2
is creating the split files. Unfortunately, this means that isn't much pdfplumber
can do to resolve the issue, so I'm closing it for now.
Hi, Thank you for your answer. I understand. I need to find another way so.
Describe the bug
To overcome the problem of long pdfs taking too long to open, I use the PyPDF2 library to keep only the pages with a table that I need. Then I open the pdf that only contains a few pages, which makes the opening faster Unfortunately in some cases, this error occurs.
Have you tried repairing the PDF?
I can't install Ghostscript due to rights issues and I wouldn't like to have to install this application in addition to mine to run my application.
Code to reproduce the problem
PDF file
I can't give my PDF and they are too big to run pdf-redactor ( around 1000 pages ).
Expected behavior
Extract without any problem
Actual behavior
Environment