Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
6.1k
stars
625
forks
source link
TypeError: unsupported operand type(s) for %: 'NoneType' and 'int' when trying to access PDF page objects #827
Closed
thefirebanks closed 1 year ago
Describe the bug
Running into this issue when opening a specific PDF file:
Code to reproduce the problem
PDF file
sample_file.pdf
If you need to redact text in a sensitive PDF, you can run it through JoshData/pdf-redactor.
Expected behavior
I should be able to access the page objects from the PDF. I tried opening it with PyMuPDF and it works.
Actual behavior
Got the error message.
Environment
Additional context
Add any other context/notes about the problem here.