weighted-damerau-levenshtein Search Results

infoscout/weighted-levenshtein #28

segmentation fault on Linux

I just tried to run a simple example with weighted-levenshtein on Linux (Fedora 36) with Python3.9 and Python3.10, but I did run into a segmentation fault. I installed the library using: ``` pip in…

maxbachmann updated 2 years ago

wolfgarbe/SymSpell #43

Support for weighted edit distance

I'm not sure if SymSpell already has support for weighted edit distance. If so, please tell me how to use it. Otherwise, I suggest to add this as another possible distance metric, in addition to Le…

heatherleaf updated 6 years ago

wolfgarbe/SymSpell #136

OCR spelling mistakes

What is the recommended practice for OCR typos that come from say poor kerning? Examples below. mformation --> information wntmg --> writing The problem I have is that SymSpell `lookup_compound…

statzhero updated 1 year ago

alphagov/openregister-picker-engine #3

Levenshtein distance and other algorithms

It could significantly slim down the data file graph if we could infer some of the typos using [string metric algorithms](https://en.wikipedia.org/wiki/String_metric) or other programmatic ways.

tvararu updated 7 years ago

infoscout/weighted-levenshtein #16

Jupyter notebook crashes when using dam_lev with transpose_c…

I'm using dam_lev in a jupyter notebook (5.4.0). Python 3.6.4 |Anaconda, Inc.| (default, Jan 16 2018, 10:22:32) [MSC v.1900 64 bit (AMD64)]. My OS is Windows 10. Running the code below, I get the erro…

BobbyClouser updated 4 years ago

elastic/elasticsearch #24655

Make suggester's string distance pluggable or configurable

### Summary The current implementation allows choosing between of Damerau-Levenshtein algorithm (2 implementations), Levenshtein algorithm, Jaro-Winkler algorithm, or ngram-based algorithm with non…

imotov updated 3 months ago

wolfgarbe/SymSpell #87

[Question] Few queries regarding upcoming changes, benchmark…

Thanks for maintaining this excellent library. I am currently contributing to the rust clone of this. I have a few queries. 1. In the following upcoming change. Can you elaborate on how the pigeon …

sai-prasanna updated 1 year ago

henryliangt/usyd #60

Distances

There are several ways to compute the distance between two arrays, each with its unique characteristics and use cases. The most common ones include: **Euclidean Distance**: This is probably the mos…

henryliangt updated 11 months ago

tesseract-ocr/tesseract #3560

output true CER for checkpoints (at least the final one)

AFAICS, `lstmtraining` produces two types of figures for measuring the error: 1. **bag-of-character training error** (on `list.train`): this is shown as - `char train=%.3f%%` every 100 iterations…

bertsky updated 2 years ago

moj-analytical-services/splink #2006

[FEAT] Exact match level required for TF adjustment - can th…

### Is your proposal related to a problem? I have an application where I wish to link dataset A, representing noisy transactions, to dataset B, a master entity list. For dataset B (the master list)…

samkodes updated 8 months ago

25 results for weighted-damerau-levenshtein

25 results
for weighted-damerau-levenshtein