cocreature / thrill

Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
http://project-thrill.org
Other
0 stars 0 forks source link

Initial (thrill) implementation #1

Closed cocreature closed 7 years ago

cocreature commented 7 years ago

I’ll try creating an initial implementation based on the algorithm presented in Figure 1 of the google paper until tomorrow (Tuesday 10.01.17) evening.

cocreature commented 7 years ago

I’ve pushed an implementation to the hyperloglog branch. The results seem to be somewhat reasonable, at least they scale correctly when I increment the number of distinct values but more testing is absolutely needed.