documentcloud / docsplit

Break Apart Documents into Images, Text, Pages and PDFs
http://documentcloud.github.com/docsplit/
Other
833 stars 214 forks source link

Scrape data from a pdf document into CSV using docsplit ? #112

Closed anil-insonix closed 10 years ago

anil-insonix commented 10 years ago

Question:-Can we scrape data from a pdf document into CSV on ruby using docsplit ?

knowtheory commented 10 years ago

@insonix-ror depending on the sorts of PDFs you have our fine friends over at http://tabula.nerdpower.org/ might be able to help you out. Their repo is also over at https://github.com/jazzido/tabula