ckan / datapusher

A standalone web service that pushes data files from a CKAN site resources into its DataStore
GNU Affero General Public License v3.0
77 stars 155 forks source link

To push PDF to Datastore. #34

Closed abhims closed 7 years ago

abhims commented 10 years ago

I think this is required. I saw exception traces in the log for it's absense.

coveralls commented 10 years ago

Coverage Status

Coverage remained the same when pulling 8d973065239d5ac966fbee0648b8458958c9575f on abhims:patch-1 into 91f68b7f962a2f43bf60fdb97e62d37172aca79e on ckan:master.

amercader commented 7 years ago

Sorry, DataPusher only deals with tabular data (CSV, Excel, etc)

RichFrazier commented 6 years ago

Having this problem on our instance https://data.amerigeoss.org

Error: File "/usr/lib/ckan/datapusher/lib/python2.7/site-packages/apscheduler/scheduler.py", line 512, in _run_job retval = job.func(*job.args, job.kwargs) File "/usr/lib/ckan/datapusher/src/datapusher/datapusher/jobs.py", line 352, in push_to_datastore table_set = messytables.any_tableset(f, mimetype=ct, extension=ct) File "/usr/lib/ckan/datapusher/lib/python2.7/site-packages/messytables/any.py", line 137, in any_tableset return parsers[attempt](fileobj, kw) File "/usr/lib/ckan/datapusher/lib/python2.7/site-packages/messytables/pdf.py", line 50, in init raise ImportError("pdftables is not installed") ImportError('pdftables is not installed',)