Shopify / camus

Kafka->HDFS pipeline from LInkedIn. It is a mapreduce job that does distributed data loads out of Kafka.
7 stars 4 forks source link

Report each file that is late, and the count of records in that file. #123

Closed olessia closed 6 years ago

olessia commented 6 years ago

Also refactor tests a bit to use tmp directories.