count-min-sketch Search Results

1000+ results
for count-min-sketch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

cegme/vose #2

Deduplication of marbles

Add a function that dedupes the repeated marble labels. This is like a reduce function that sums the duplicate weights. You can't really merge marbles, but lets pretend that they are play-doh balls. T…

cegme updated 11 years ago
1
mitdbg/aurum-datadiscovery #52

Faster indexing

- Check how to improve elasticsearch's performance - Build a pre-indexer that filters out data that has been indexed for a given column. Basically this requires a count-min sketch per column, so that …

raulcf updated 7 years ago
4
dib-lab/khmer #1103

rename default file extensions to match new nomenclature?

As documented in https://github.com/dib-lab/khmer/blob/doc/binaryformats/doc/dev/binary-file-formats.rst we have too many names for the same things: 1. countgraph/countinghash/count-min sketch with fi…

mr-c updated 7 years ago
6
spotify/scio #2161

Approximate top-K frequent items

@idreeskhan pointed me to Space Saving and other variants, approximate algorithms that can answer top K items & frequencies. Could be nice to have. Right now we have Count-Min Sketch from Algebird …

nevillelyh updated 5 years ago
1
vectordotdev/vector #18709

Unable to detect 'sketch' data types in vector vrl language

### A note for the community * Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to …

seanlowcy77 updated 10 months ago
2
alecmocatta/streaming_algorithms #14

Is this abandoned?

There are dependency updates like the rand one from dependabot not merged. It looks like a useful crate but I'm trying to assess whether I'd need to fork it or whether it is still maintained.

rbtcollins updated 1 year ago
1
prestodb/presto #21469

Runtime Metrics for skewed key of a join

It would be great if we could add a runtime metric which will output skewed keys of joins. Maybe we could use count min sketch or related datastructures in the lookup join operator itself to detect…

jaystarshot updated 4 months ago
2
twitter/algebird #497

CountMinSketch[K] assumes working equals method on K.

For small count-min sketches, you create CMSItem: https://github.com/twitter/algebird/blob/develop/algebird-core/src/main/scala/com/twitter/algebird/CountMinSketch.scala#L467 But to get the frequenc…

johnynek updated 7 years ago
11
microsoft/hyperspace #441

[PROPOSAL]: Data Skipping Indexes

## Problem Statement Add support for data skipping indexes. ## Background and Motivation Hyperspace has been supporting hash-partitioned covering indexes only. Covering indexes are good for s…

clee704 updated 3 years ago
9
pingcap/tidb #40469

collect table statistics when importing data by Lightning

## Enhancement Currently, after we import data to the cluster, we need to analyze the table, which is time-consuming since it needs to scan the whole table. Collecting table statistics can be done …

xuyifangreeneyes updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for count-min-sketch

1000+ results
for count-min-sketch