AiDAPT-A / VisArchPy

pipelines for the extraction and processing of visuals from PDFs
https://visarchpy.readthedocs.io
MIT License
3 stars 1 forks source link

How should be metadata be organize in data-pipelines? #13

Closed manuGil closed 1 year ago

manuGil commented 1 year ago

We should discuss what format and structure for the metadata is best for the research team. For example, should we go for a data table structure as CSV? or can we consider another format/structure?

manuGil commented 1 year ago

ccomplete