A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
The tests for the phonetic similarities - soundex and metaphone are sparse. Need to add more tests/examples to make sure those are working as expected.
The tests for the phonetic similarities - soundex and metaphone are sparse. Need to add more tests/examples to make sure those are working as expected.
https://github.com/vickumar1981/stringdistance/blob/master/src/test/scala/fixtures/TestSoundCases.scala#L8
Can also enable/use (currently, disabled for phonetic similarities only) the coverage report to see what part of the algorithms is untested.