cloudera / poisson_sampling

10 stars 13 forks source link

little improvements #1

Closed piccolbo closed 11 years ago

piccolbo commented 11 years ago

Using input formats and a feature new in 2.1 and recycling to simplify the code a bit. I hope you enjoy it. I am not expecting you to merge, you probably want to keep your repo aligned with the blog post. This is just so that you can take a look at the changes.