Open DevinBayly opened 5 days ago
we have gotten much further with https://github.com/allenai/pdffigures2, so it's now possible to let this system do most of our extraction, it also provides caption information which wasn't attached to the publications db, but we could make a new table for it, or a field on each element within the figure table
consider closing this issue?
documentation is occuring in this item #17
look at the pdf image grabber that Carolina mentioned