datasketch Search Results

251 results
for datasketch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ekzhu/datasketch #157

WeightedMinHash and negative vector spaces

I have a large number of embeddings (768 dimensions) which I am attempting to cluster. I was playing around with datasketch and WeightedMinHash to see if it was possible to use the resulting Jaccard d…

dandiep updated 1 year ago
1
datasketch/dsthemer #13

Let user define a color palette to select from when assignin…

marianaviro updated 4 years ago
1
datasketch/shinypanels #15

Users can define their own custom theme

jpmarindiaz updated 3 years ago
1
datasketch/sib-colombia #78

Actualización de la sección de Kit de prensa

Cargar el PDF adjunto a este issue para descarga del [kit de prensa](https://cifras.biodiversidad.co/mas/prensa) y ajustar los datos de contacto a: Sistema de Información sobre Biodiversidad de C…

RicardoOrtizG updated 5 months ago
3
ekzhu/SetSimilaritySearch #4

Using all_pairs() on a 14k dataset causes MemoryErrors

Hi, I'm trying to use the all_pairs() function to find all the (near-)duplicates in a set of about 14000 text documents (after turning them into ngram shingles first). However, I'm running against …

tsela updated 4 years ago
11
datasketch/shi18ny #34

Language selection dropdown missing

When I run the code provided in the example below, the dropdown menu to select a language is missing. ``` if (interactive()) { library("shiny") library("shi18ny") ui

gacolitti updated 10 months ago
8
ekzhu/datasketch #207

Too large minhashLSH index

Hi, I have a question about large-scale LSH index. If I have billions of documents, I suppose even 1T RAM is not enough to do in-memory LSH, is there any recommended way to use datasketch for this sce…

bryanyzhu updated 1 year ago
10
ekzhu/datasketch #237

Question: Effects of Bit Truncation on MinhashLSH?

Maybe a bit of a dumb question but I'm a little confused by the `_insert` method in the `MinhashLSH` class: ```python def _insert( self, key: Hashable, minhash: Union[Mi…

123epsilon updated 3 months ago
16
ekzhu/datasketch #200

Poor default args in MinHashLSH?

I prepare 10 synthetic examples. ```python import random values = [] queries = [] count = 1 for _ in range(10): value = [] for _ in range(100): value.append(count) …

kuk updated 1 year ago
1
ekzhu/SetSimilaritySearch #11

Very slow performance on large sets

Hi, this package looks really cool and I'd love to use it for my use case. I have about 7,000 sets with about 1,000 elements each that I'm using as my index. I also have a set of about 1,000 quer…

briannadon updated 2 years ago
1

上一页 1...1 2 3 4 5 6 7...26 下一页

251 results for datasketch

251 results
for datasketch