garysieling / pdf-js-csv

Exploring extracting tables from a PDF to CSV using PDF.JS
http://garysieling.com/blog/extracting-tables-from-pdfs-in-javascript-with-pdf-js
104 stars 26 forks source link

Make a nodejs version #10

Closed garysieling closed 11 years ago

simoncheeseman commented 11 years ago

This would be great! I am currently using a different module for parsing PDF tables and it's quite clunky...

garysieling commented 11 years ago

Which one are you using? That'd be helpful for feature comparison.

simoncheeseman commented 11 years ago

I'm using pdf2json (https://github.com/modesty/pdf2json). I only extract text strings and their x/y coordinates.

garysieling commented 11 years ago

Thanks!

garysieling commented 11 years ago

Took a first take on the node JS version, I'm testing from the command line so I suspect more work may be in order to use it in a webapp, but I need to come up with an example use case for that.