dgrtwo / fuzzyjoin

Join tables together on inexact matching
Other
664 stars 62 forks source link

Normalized string distance? #45

Open jzadra opened 6 years ago

jzadra commented 6 years ago

Is there any way to match on normalized string distance?

Ie for osa, lv, and dl, dividing by the nchar of the longest word, or for lcs the additive lengths of both?

1 letter off of a 4 letter string is quite different than 1 letter off of a 20 character string.