-
This issue tracks missing components in the rust port:
- [ ] Levenshtein
- [x] basic distances
- [x] cached distances
- [ ] simd implementation
- [ ] edit operations
- [x] Damera…
-
https://aclanthology.org/C08-2006.pdf
-
- Link the series folders on disk with metadata from the db
- Endpoint that exposes this info
-
### Is your feature request related to a problem?
Feature request #2385 discusses fuzzy string matching that was implemented in PR #1904.
@tobiemh mentioned three not-yet-implemented string distance…
-
Hi, I have some questions about this library if you don't mind:
1. If I'm correct, you keep the word list in memory managed by a process
- the included word list contains 40k+ entries, and in addi…
-
Now that the package is getting more mature, it would be nice to add support for other distance metrics (specifically, hamming and cosine distances). These should be relatively easy to implement follo…
-
Hi,
I was reading this documentation (http://sampleclean.org/guide/) and I see that you can use any similarity metric to find the similarity between two strings on one column attribute. Can you use m…
-
Is this intended behavior?
```
user=> (require '[clj-fuzzy.jaro-winkler])
nil
user=> (clj-fuzzy.jaro-winkler/jaro-winkler nil nil)
0.4
```
-
Hey Charlie, this is Mark from the HG team. I installed LLString by cloning the repo and the running `pip install LLString/llstring`, but I get a problem when I try to import it. This is not a problem…
-
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8421191/ e.g.
We are already kind of doing very basic word2vec with the numpy version of remove_redundancies that Adam wrote; can we use an algorithm t…