This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
I have added code to take input json files from a directory if -d option is given with the arguments. Hence, user can either give output as individual json files from stdin or provide a directory where json files reside.
I have added code to take input json files from a directory if -d option is given with the arguments. Hence, user can either give output as individual json files from stdin or provide a directory where json files reside.