jalan / pdftotext

Simple PDF text extraction
MIT License
870 stars 99 forks source link

Getting error Invalid ToUnicode Cmap #118

Closed de3 closed 11 months ago

de3 commented 1 year ago

I get this error for some pdf documents

Got stderr: Nov 07, 2023 12:09:01 PM org.apache.pdfbox.pdmodel.font.PDFont loadUnicodeCmap
WARNING: Invalid ToUnicode CMap in font KQEMOW+DejaVuSerifCondensed-Bold

I can open pdf using browser or pdf reader, but can't open it using pdftotext

jalan commented 1 year ago

That looks like an error from a different project. Are you even using this project? If so, attach the PDF and we can have a look.

jalan commented 11 months ago

Closing due to lack of response