rockymadden / stringmetric

:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
https://rockymadden.com/stringmetric/
486 stars 81 forks source link

Dice / Sorensen Metric - Incorrect Result? #29

Open AnanthaRajuC opened 7 years ago

AnanthaRajuC commented 7 years ago

for dice/sorensen metric, your example of "night", "natch" you are showing a value of .60, however according to the algorith the expected answer for the same two strings is .25

could you please elaborate on how you got .60?

links for reference: https://en.wikipedia.org/wiki/S%C3%B8rensen%E2%80%93Dice_coefficient

http://www.gutenberg.us/articles/eng/S%C3%B8rensen_similarity_index

http://wooorm.com/dice-coefficient/