Open swoltron opened 5 years ago
Could you please confirm "pip freeze"? If you have library "pdfminer" & "pdfminer.six", "pip uninstall pdfminer" and "pip uninstall pdfminer.six". Then all clear, "pip install pdfminer.six".
Sorry... I have python3.7.7env. I cannot give you any solution.
I'm having the same problem, but only with files that was saved through "Microsoft Print to PDF". I'm trying to convert to text a table that is converted to PDF. If I just save as pdf it works, if I print as PDF (through this "printer"), it doesn't. Hope it helps to solve this issue.
hi, I am facing this error, but unfortunately i cant modify the pdf file, so i need to handle this programatically, Could you guide me if you have resolved it? My metadata has this as a field value: {'q':
I am also facing this error on some PDF files. I was able to duplicate the "fix" of using a PDF created using "save as" instead of "print to PDF", but like reema-dass26, I don't always have the ability to do that. I can't believe we are the only two that have this problem....
I am using pdfminer's pdf2txt.py to extract text from different pdf's. The algorithm works very well in a lot of scenarios, but I am getting this error and I'm not sure what I can do to get pdfminer to work.
AttributeError: 'PDFObjRef' object has no attribute 'decode'
I have run this same command on other documents and it is only recently that I started seeing this.
I am simply running this off of the command line:
This is the complete output from pdf2txt.py:
Any help is appreciated!