pdf encoding wrong - Githubissues

atlanhq / camelot

Camelot: PDF Table Extraction for Humans

https://camelot-py.readthedocs.io

Other

3.65k stars 356 forks source link

pdf encoding wrong #359

Closed zamankul74 closed 4 years ago

zamankul74 commented 5 years ago

On some PDF files converting to CSV/reading from PDF going with wrong encoding and I can't resolve it with specifying UTF-8 ecoding I'm using regular code camelot.read_pdf('test.pdf', pages='all', multiple_tables=True, encoding='utf-8') ... table[0].to_csv('test.csv', encoding='utf-8') CSV file has strange encoding like ","","","","","","","","","","","","","","","",""," What I need to do in this case. Any suggestions?

anakin87 commented 5 years ago

Pleas attach the PDF to verify this issue

vinayak-mehta commented 4 years ago

Closing due to inactivity.