galkahana / pdf-text-extraction

cli for extracting text from PDF files (and maybe possibly tables)
Apache License 2.0
74 stars 19 forks source link

not working for Chinese PDF #6

Closed z16166 closed 1 year ago

z16166 commented 2 years ago

not working for PDF with Chinese characters.

Output is unreadable and doesn't contain all the characters of that pdf.

galkahana commented 1 year ago

You might wanna provide a sample file.