atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.61k stars 349 forks source link

Camelot won't pull data, just the headers and page numbers #369

Closed ghost closed 4 years ago

ghost commented 4 years ago

So I am running this code: `import camelot from camelot.core import TableList

tables = camelot.read_pdf(PDF) tables[0].to_csv('foo.csv')`

Which should be saving the contents of the file to that csv, but this is all that it saves:

"Inside" "Table: TableName Pages 2-8"

This is the same for the dataframe as well. The accuracy of this table was also 100% apparently. But there is no actual data. The next table in the same pdf has data but it is formatted pretty poorly, "Table name","","" "","12.60%","" "","8.66% 8.10% 9.05% 7.35%","" "","5.67% 4.65% 1.32%","" "","-3.99%","" "","-9.03%","" "2009 2010 2011 2012 2013 2014 2015 2016 2017 2018

Am I doing something wrong?

anakin87 commented 4 years ago

Please post the PDF

vinayak-mehta commented 4 years ago

Closing this due to inactivity.