This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
16
stars
35
forks
source link
First steps towards #33 - refactor encoding detection into etllib.py and leverage in tsvtojson.py #35
Hi @chrismattmann Thanks for the fix and the merge. Sorry I could not work on this sooner, as I was occupied with some other course work. Will pull the latest changes.
Hi @chrismattmann Thanks for the fix and the merge. Sorry I could not work on this sooner, as I was occupied with some other course work. Will pull the latest changes.