atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.61k stars 349 forks source link

Can't get table name when extracting tables from pdf #378

Closed vslaser closed 4 years ago

vslaser commented 4 years ago

When I extract tables from pdf, I see that there are options to identify the page number from which the table is extracted and the table order for the page. However, the function does not extract the name of the table from pdf (the name usually is either at the top or bottom of a table).

anakin87 commented 4 years ago

Currently, Camelot can't extract table titles. (https://github.com/atlanhq/camelot/issues/247)

vslaser commented 4 years ago

Thanks

Regards, Vijay

On Fri, Oct 4, 2019 at 9:34 AM anakin87 notifications@github.com wrote:

Currently, Camelot can't extract table titles. (#247 https://github.com/atlanhq/camelot/issues/247)

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/atlanhq/camelot/issues/378?email_source=notifications&email_token=AEYEAR5EQDHNNMGSS7ZULSTQM352ZA5CNFSM4I5MJERKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAK542Q#issuecomment-538304106, or mute the thread https://github.com/notifications/unsubscribe-auth/AEYEAR6Y2J2NVFTKXHP3HPDQM352ZANCNFSM4I5MJERA .

vinayak-mehta commented 4 years ago

@vslaser You can also try specifying table areas that include table titles.

brifordwylie commented 3 years ago

Related to: https://github.com/atlanhq/camelot/issues/395