Shopify / camus

Kafka->HDFS pipeline from LInkedIn. It is a mapreduce job that does distributed data loads out of Kafka.
7 stars 4 forks source link

Create a new Azkaban task for deduplication. #112

Closed olessia closed 6 years ago