chrismattmann / etllib

This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
16 stars 35 forks source link

First steps towards #33 - refactor encoding detection into etllib.py and leverage in tsvtojson.py #35

Closed chrismattmann closed 9 years ago

abhinandkr commented 9 years ago

Hi @chrismattmann Thanks for the fix and the merge. Sorry I could not work on this sooner, as I was occupied with some other course work. Will pull the latest changes.