-
Hi, this package looks really cool and I'd love to use it for my use case.
I have about 7,000 sets with about 1,000 elements each that I'm using as my index. I also have a set of about 1,000 quer…
-
-
El pie de las gráficas dice que se puede ir a http://bit.ly/mobility-actions para ver el crédito completo pero no lleva a nada (link no existe). Además, hacer clic cerca del crédito (que es la acción …
-
The following is the discussion with Mayank on slack:
Mark: Hi Team, I have seen that in 0.4.0, pinot has implemented the initial version of theta-sketch based distinct count aggregation function, …
-
Estaba cambiando entre los filtros en un periodo de tiempo corto. Seleccione temática objeto de comercio y tipo de grupo Interés de conservación y se puso gris el explorador y se desconecto del servid…
-
For very large corpora, it would be good to have a database backend so things don't have to stay in memory.
-
Hi, this is a great work! I am trying to experiment with JOSIE to find joinable tables and unsure about the data pipeline. Could you briefly explain how to use this JOSIE codebase to find joinable tab…
-
hey, thanks for this great project.
I want to use min hash for my text embedding vectors which have both negative and positive numbers.
I have searched the issues and found that weighted min hash ca…
-
Thanks a lot for your work :) amazing job!
Are there any plans to create an implementation that can be parallelized across multiple threads (processes in Python)?
More context:
I have a large f…
-
Hello!
Firstly thanks for the library, very happy overall. We are currently using MinHashLSH with Cassandra backend to dedupe a large number of documents and are running into a niggling issue with …