Shopify / camus

Kafka->HDFS pipeline from LInkedIn. It is a mapreduce job that does distributed data loads out of Kafka.
7 stars 4 forks source link

Camus at shopify #24

Closed yagnik closed 9 years ago

yagnik commented 9 years ago

Try 2 of camus at shopify (long live albert?, I wish we called it king then I could say long live the king)

The following are changes that are included in this PR:

What we do not support yet: Camus sweeper to get around small file problem (Upstream and my branch have significantly diverged and they are refactoring it too. I'm in touch with maintainers to get most of my changes in). We will support this soon enough.

Please review @drdee @airhorns cc @Shopify/data-engineers cc @wvanbergen @snormore @eapache

airhorns commented 9 years ago

:sheep: woooo!