simhash Search Results - Githubissues

422 results
for simhash

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

prikevs/simhash-demo #1

simHash函数计算res的时候多移动了一位吧

导致res有33位，实际计算海明距离只有后32位参与了计算，漏掉了一位有效的，多了一位无效的

finalgene updated 3 years ago
1
edgi-govdata-archiving/web-monitoring-diff #16

Try to identify situations where parts of a page have *moved…

One situation we see that can be confusing for analysts is when a portion of the page has *moved* by swapping locations with another (maybe two paragraphs get reversed, maybe the navigation moves from…

Mr0grog updated 3 years ago
7
htm-community/htm.core #259

Extra Encoders (Image, Video, sound, ...)

For some experiments, I'd like to setup `encoders/extra/{vision,audio,...}/` with specialized encoders for multiple modalities. There existed special repos as - https://github.com/htm-community…

breznak updated 5 years ago
6
ghostwords/chameleon-crawler #7

Store fingerprinting scripts

It would be really great to be able to store a copy of all the scripts identified as fingerprinting scripts. That way we could see if any scripts are commonly being used by different attackers. This c…

cooperq updated 9 years ago
7
lanmaster53/recon-ng-marketplace #80

Intelx.io

**Is the feature request related to a tool? Please describe.** From the [site](https://intelx.io/): ``` Intelligence X allows you to perform a search for these selector types: - Email address …

cam-barts updated 4 years ago
3
dpwe/audfprint #88

Can this algorithm load the historical features into memory …

Can this algorithm load the historical features into memory first, so that the matching speed is improved, but I don't know how to modify your basic code

xuboot updated 2 years ago
7
antlabs/strsim #1

目标和参考资料

## 目标 * 用go实现字符串相似度lib * 处理中文准确度较高(目前很多老外写的库处理中文效果不佳) * 集成多种相似度算法(编辑距离,汉明编码，骰子系数) ## 莱文斯坦-编辑距离(Levenshtein) * https://zhuanlan.zhihu.com/p/91667128 * https://www.jianshu.com/p/a617d20162cf (以…

guonaihong updated 1 year ago
12
huggingface/datasets #6007

Get an error "OverflowError: Python int too large to convert…

### Describe the bug When load a large dataset with the following code ```python from datasets import load_dataset dataset = load_dataset("liwu/MNBVC", 'news_peoples_daily', split='train') ``…

silverriver updated 8 months ago
8
ravendb/ravendb #16603

[Feautre] Add vector similarity search support

If I want to store feature vectors (a numeric array, e.g. `[2.01, 20.85, 14.05]`) in the DB, I'd like to query other records (with arrays of the same dimension) similar to the selected one(s) with a c…

AKlaus updated 9 months ago
3
TraceMachina/nativelink #901

Add a delta store (similar to `DedupStore`)

Inspiration is taken from how git packs achieve high compression rates with good random access performance. The 2 key components are (1) a clustering strategy to store similar objects together and (2)…

barrbrain updated 5 days ago
13

上一页 1...4 5 6 7 8 9 10...43 下一页

422 results for simhash

422 results
for simhash