-
Hello Team,
It seems that new cardinality estimator paper has two new estimators, the improved estimator and MLE estimator, better than the one originally proposed, see paper here: https://arxiv.or…
-
Utilizar Hyper LogLog para analisar um alto volume de palavras para identificar trechos únicos
Referência : https://medium.com/botify-labs/hyperloglog-or-how-we-estimate-large-numbers-of-unique-url…
-
- Benchmark testing for current Sparse representation as part of `PFADD`.
- Analyse feasibility of switch between Sparse and Dense representations based on benchmark testing.
- Implementation to swi…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/doris/issues?q=is%3Aissue) and found no similar issues.
### Description
hll类型可以通过hll_from_base64函数导入,除了hll类型…
-
## Feature request
想要直接写入hll类型,而不是使用HLL_HASH函数转换通过明细导入
需要直接写入hll等聚合函数中间状态类型,就类似clickhouse的AggregateFunction类型支持写入任何聚合函数的中间聚合状态,只需要写入的字节数组符合对应聚合函数序列化格式就能直接写入。
通过HLL_HASH函数导入hll类型,原始导入数据太多,直接…
-
StarRocks support hll_cardinality
https://docs.starrocks.io/docs/sql-reference/sql-functions/scalar-functions/hll_cardinality/
however, in trino's compatibility, trino support hll type's cardinality…
-
We found that the dense-encoding part of HyperLogLog can be significantly accelerated by SIMD instrutions.
Our benchmark tests the performance of merging 3 dense hll structures.
```
pfcount key…
-
Hi Simple HLL team,
Is there a reason why ahash was used (I know ahash is fast and CoS attack resistant) but not xxh3 (rust version of xxhash) or wyhash-rs.
Thanks,
Jianshu
-
### Proposal
I would like to propose a new metric type: Distinct Count.
A distinct count records the number of unique things placed into a set. However, exact precision is not required for perf…
-
HLL currently uses 8 bit per register which is not always required (depends on the parameter `b`)