jqnatividad / qsv

Blazing-fast Data-Wrangling toolkit
https://qsv.dathere.com
The Unlicense
2.52k stars 71 forks source link

Add PDF support #2203

Open jqnatividad opened 1 month ago

jqnatividad commented 1 month ago

Apart from CSV, TSV, TAB and spreadsheets, PDF - unfortunately, is a common format for distributing tabular data (e.g. financial statements).

Use https://github.com/sp1ritCS/tabula-rs/ to extract tabular data from PDFs.

rzmk commented 1 month ago

Their license is AGPL-3.0.

jqnatividad commented 1 month ago

Thanks @rzmk . Opened a ticket if it can be dual-licensed

https://github.com/sp1ritCS/tabula-rs/issues/4