Open jqnatividad opened 9 years ago
Either with pdftables or Tabula
For extracting the fulltext and metadata from PDFs and other files we have developed ckanext-extractor. However, it only supports text, tabular data is not treated in a special way.
Either with pdftables or Tabula