Open vk2diy opened 1 month ago
What version of pdfminer.six are you using? I can't reproduce this with either Python 3.11 or 3.12 and pdfminer.six v20240706.
Looks old.
./lib/python3.12/site-packages/pdfminer-20191125.dist-info
Unsure why it would be old, I used pip
to install it. I'm not really a python person.
Description
Crash on non-ASCII input:
UnicodeDecodeError: 'ascii' codec can't decode byte 0x85 in position 0: ordinal not in range(128)
Steps to reproduce the bug
To make it easier, this will download mc3362.pdf.
wget https://github.com/user-attachments/files/16489263/mc3362.pdf && pdf2txt.py mc3362.pdf
Error produced