atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.61k stars 349 forks source link

pdf file with multi pages can't parse fully,the second page's tables can not display #492

Open cheneygan opened 1 year ago

cheneygan commented 1 year ago

a pdf file has tow pages ,the first page has 3 tables,the second has 2 tables. but when use camelot.read_pdf ,with parameter: pages='all', the result return only 3 tables,not 5 tables.

import camelot tables = camelot.read_pdf('a22.pdf',pages='all') tables

why return only 3 tables? thanks the source pdf is here: [a22.pdf](https://github.com/atlanhq/camelot/files/10253806/a22.pdf)
shivambaldha1 commented 1 year ago

yes, some time camelot not extract the all tables, i faced the same issue.