sparsecode / DaFlow

Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Other
26 stars 13 forks source link
apache-spark avro csv etl etl-framework etl-pipeline hadoop hive join-data json parquet scala transformation-rules

DaFlow [Data Flow(ETL) Framework]

Build Status License codecov Code Climate

Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.