Open fcfort opened 5 years ago
Next steps:
tabula-java
to take Betterment PDF and extract transaction tables.tabula-java
Java lib and calling from Chrome extension in the browser, i.e. test the java-lib -> JS -> Browserify -> Chrome extension pathway.
This project currently uses a homegrown hacky line-by-line PDF to text conversion in order to extract transaction data from Betterment PDFs. Would be much better if there we could use a more robust library designed to extract tabular data from PDFs.
See https://tomassetti.me/how-to-convert-a-pdf-to-excel/ along with Python impl (https://github.com/camelot-dev/camelot) and Java impl (https://github.com/tabulapdf/tabula-java).
One small problem is that there is no JS implementation. One possibility is to use a Java to JS transpiler, e.g. https://github.com/cincheo/jsweet or https://github.com/google/j2cl.