adamkucharski / scrapR

Extract raw underlying data from PDF figures
MIT License
54 stars 7 forks source link

Automatically extract values #2

Open adamkucharski opened 1 year ago

adamkucharski commented 1 year ago

If swap from list storage to a data.frame with enumerated elements in load_PDF_data, could explore automated identification of geometry (e.g. axes, labels etc.) and hence potential for automatic extraction.

adamkucharski commented 1 year ago

Can also explore potential for combining with functionality in pdftools to also extract text, alongside current vector extract using grImport.