ckan / ckanext-pdfview

PDF viewer for CKAN
GNU Affero General Public License v3.0
26 stars 47 forks source link

Consider adding optional PDF data extraction #18

Open jqnatividad opened 9 years ago

jqnatividad commented 9 years ago

Either with pdftables or Tabula

torfsen commented 8 years ago

For extracting the fulltext and metadata from PDFs and other files we have developed ckanext-extractor. However, it only supports text, tabular data is not treated in a special way.