-
For those still looking for a (team) project idea, HyperLogLog is an interesting probabilistic data structure that is worth studying.
https://chengweihu.com/hyperloglog/
It would benefit from a …
-
Utilizar Hyper LogLog para analisar um alto volume de palavras para identificar trechos únicos
Referência : https://medium.com/botify-labs/hyperloglog-or-how-we-estimate-large-numbers-of-unique-url…
-
The unittest for the hyperloglog is instable and fails every now and then with this error:
```
___________________________ test_simple_hyperloglog ____________________________
def test_simple_hy…
-
**Is your feature request related to a problem? Please describe.**
I would like to be able to implement a GPU version of Spark's `approx_count_distinct` function, which uses the [HyperLogLog++](https…
-
HyperLogLog is an algorithm that provides 97% accuracy on cardinality counts (ie, unique visitors), while using a very small memory footprint. We could add support for this algorithm within Piwik to n…
-
@markharwood,
Great work on significant terms, maybe even greater visualization of the 4 strategies in your comment!
Working in the same space, yet having access to more detailed data, we have found…
-
_5.2.z_ (commit 3ee49cbb0927101c79e043fd7453e552704ba95c)
Failed on oracle-11: https://jenkins.hazelcast.com/job/Hazelcast-5.maintenance-sonar/431/testReport/junit/com.hazelcast.cardinality.impl.hy…
-
I would like to recommend HyperLogLog (HLL, or some of its useful variants, HLL++, LL beta) support as a native data type.
For a possible (Postgres) implementation see:
https://github.com/aggregatek…
-
RFC and the first PR: https://github.com/ClickHouse/ClickHouse/pull/53240
this issue is for dicussing what we will do in the future.
Use cases of column statistics:
- join reordering
- filter …
-
The following extensions need implementation for the upcoming 2.0.0 release.
#### Extensions to transfer from `main` branch
- [x] Citus
- [x] PostGIS
- [ ] #142
- [x] https://github.com/Samagra…