Open miltonlab opened 8 years ago
Some command to extrat tabular data from PDF to spreadshet or CSV or txt? pdftotext is not exact
This is not an easy problem. textract is a good project, but when it comes to PDFs, which can literally be images, OCR is sometimes required. Also, formatting is a major issue and no automated system will be perfect.
Some command to extrat tabular data from PDF to spreadshet or CSV or txt? pdftotext is not exact