chrisdev / pdftables

forked from the scraperwiki pdftables (0.0.4) project which was removed Github
13 stars 17 forks source link

Accuracy data #2

Open shankhs opened 9 years ago

shankhs commented 9 years ago

Hi,

Does anybody have any idea about the accuracy/efficiency of this library? Is there any future plan to update this project?

Thanks

chrisdev commented 9 years ago

@shankhs thanks for checking out this fork. As the README says I forked this from the ScraperWiki guys and they have removed their master from GitHub (or was it BitBucket?)

I was trying to see if i could use it for some internal projects but unfortunately the PDFminer Library upon which it is based was/is quite problematic. I suspect the same issues affected to ScraperWiki folks

I switched to using http://poppler.freedesktop.org in my internal project even though it was Python based. At least it was reliable and I hacked something together using subprocess calls.

However, I you have some ideas how to progress this project feel free to submit a pull request