-
I'm a data engineer at [IPUMS](ipums.org) and am testing out this library in order to link US historical decennial censuses. The problem I'm running into is that I need to know the indices of the matc…
-
From time to time, I have difficulties with building the native extension in CI environments.
In this use case, I don't really care the performance, so the pure version could be more than enough.
…
-
BPO | [38530](https://bugs.python.org/issue38530)
--- | :---
Nosy | @vstinner, @aroberge, @serhiy-storchaka, @1st1, @pablogsal, @tirkarthi, @isidentical, @sweeneyde
PRs | python/cpython#16850python/cp…
-
I see that to generate documentation is used web/Makefile.
Theoretically this file is not necessary.
- setutool does not know where it is `copy.py` and it fails on generate documentation using `bu…
-
Have you considered other algorithms such as Smith-Waterman, Jaro-Winkler, or even the new/improved Levenshtein-Damerau?
-
**Options for computing distance between 2 character sequences in python:**
- textdistance: full python lib
- jellyfish: full python lib
- strsimpy: full python lib
- python-Levenshtein: C lib
…
-
This is a minor thing, but it would improve usability.
**(1) Issue/Bug Description:**
Launcher program search should prioritize matching first part of word.
**(2) Steps to reproduce (if y…
-
In the case of matching via partial_ratio between the following strings:
"GREAT GREEK 70005"
"GREAT GREEK MEDITERRANEAN GRILL WINTER GARDEN"
A partial score of 71% is expected, but RapidFuzz retu…
-
Dear,
The package is great! Fast and easy to use.
I have a challenge that I believe that parallelization is the best option. Could you share some documentation on the best way to parallelize str…
-
Greatly appreciate the work on this package. Our data deals with a wide diversity of names (Hispanic, Asian, etc.), and we've found that the string distance methods included with fastLink have occasio…