vickumar1981 / stringdistance

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
https://vickumar1981.github.io/stringdistance/api/com/github/vickumar1981/stringdistance/index.html
Other
75 stars 15 forks source link

Allow access to underlying n-gram tokenizer #28

Closed vickumar1981 closed 4 years ago

vickumar1981 commented 5 years ago

The N-gram implementation: https://github.com/vickumar1981/stringdistance/blob/master/src/main/scala/com/github/vickumar1981/stringdistance/impl/NGramImpl.scala

vickumar1981 commented 4 years ago

Added functions to n-gram classes: https://github.com/vickumar1981/stringdistance/pull/46

vickumar1981 commented 4 years ago

Included in version 1.2.0