This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
16
stars
35
forks
source link
fix for dupliate output for each unique record #27
Removed duplicated addition into the list.
https://github.com/chrismattmann/etllib/issues/25