-
### Motivation
the datasketches library has a new Unique Counting Sketch called CPC sketch that has better accuracy per size than HLL.
https://github.com/DataSketches/sketches-core/releases
…
pdeva updated
3 years ago
-
Documentation here : http://ekzhu.com/datasketch/lsh.html#minhash-lsh-at-scale
- Remove MinHashCustom class implementation
- Add MinHash LSH at scale : use of Redis database to store objects
-
-
Here is my code:
```
from difflib import ndiff
from time import perf_counter
from datasets import load_dataset
from datasketch import MinHash, MinHashLSH
import numpy as np
from model2vec imp…
-
When running in Python 3.12, get the following error:
```
datasketch/__init__.py:4: in
from datasketch.lsh import MinHashLSH
datasketch/lsh.py:7: in
from datasketch.storage import or…
ekzhu updated
8 months ago
-
**Use case**
Though Clickhouse support `uniqTheta` to estimate cardinality, the error in estimation is still so high in my case. I have so much free memory so i want to optimize the `uniqTheta` error…
-
Para las categorías de endémicas y migratorias ya que es un único valor, las gráficas de pie, dona o maptree no tienen sentido. Se podría dejar una gráfica con el mapa de Colombia en Coropletas indica…
-
El mapa de Amazonas está incompleto.
-
Hey,
Thanks for creating a rust binding for datasketch, I was looking for the same to use the datasketch between java & rust project
If possible can you provide a support for Deserialise / Seria…
-
Agregar botón para la descarga de datos como está en la versión actualmente al aire: