aphyr / tesser

Clojure reducers, but for parallel execution: locally and on distributed systems.
872 stars 39 forks source link

Add support for cardinality estimation using HLL+ #7

Closed ghoseb closed 9 years ago

ghoseb commented 9 years ago

Add support for set cardinality using Hyperloglog+

Calculate cardinality of extremely large sets in constant space (with
tunable accuracy) using `tesser.cardinality/hll`.

An HLL+ instance can be used with `tesser.math/digest` xform.

HLL+ implements the Digest protocol from `tesser.quantiles`.

Add serialization support for HyperLogLog+

* Depends on upcoming version of `tesser.math` for xform support
* Add an explicit provided dep for HLL lib during compilation
* Upgrade `test.check` to latest version
aphyr commented 9 years ago

Cool, thanks!

ghoseb commented 9 years ago

Thank you sir! :-)