allenai / pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.
http://pdffigures2.allenai.org/
Apache License 2.0
611 stars 122 forks source link

Could you consider officially making `FigureExtractor.parseDocument` public #41

Closed reynoldsm88 closed 3 years ago

reynoldsm88 commented 3 years ago

There is a lot of useful information in the intermediate representation of the document, it would be nice to have access to that in the client facing API

chrisc36 commented 3 years ago

Close by #45