sensiblecodeio / scraperwiki-python

ScraperWiki Python library for scraping and saving data
https://scraperwiki.com
BSD 2-Clause "Simplified" License
160 stars 69 forks source link

Dependency on pdftohtml #91

Closed hassaananjum closed 8 years ago

hassaananjum commented 8 years ago

The method pdftoxml is dependent on the utility pdftohtml. If the utility is missing, no error is reported but no xml is returned either on calling the method on pdf file data. To fix the problem pdftohtml needs to be installed which is part of poppler-utils. So run sudo apt-get install poppler-utils. The devs need to update the Readme file for dependencies.

StevenMaude commented 8 years ago

Thanks for reporting :+1:

navulurinaresh commented 7 years ago

Thanks a lot. Working perfectly