Open yosimaurer opened 6 years ago
FuzzyMatches is based on Latin character set. See https://github.com/tylerjensen/FuzzyStrings/blob/master/src/DuoVia.FuzzyStrings/DuoVia.FuzzyStrings/StringExtensions.cs#L53
I'm open to a pull request to resolve that for other character sets.
@yosimaurer the primary problem would be the DoubleMetaphone algorithm that relies on Latin character set and generic English pronunciation. To support another language/character set, you would need to extend that algorithm and modify the FuzzyMatches mashup algo that uses all four of the base algorithms.
FuzzyEquals and FuzzyMatches does not work with hebrew charachters - and seem to ignore them.
However other fuzzy methods like DiceCoefficient work well.
sample code:
string str1 = "אבג"; string str2 = str1;
Results:
False -0.0625 True 0.999999