maxharlow / csvmatch

🔎 Finds fuzzy matches between CSV files
Other
183 stars 22 forks source link

Parameters for blocking #11

Closed maxharlow closed 7 months ago

maxharlow commented 7 years ago

One option to specify fields to use to create the blocks, another (?) to set the method -- default to exact match, options for Metaphone etc

Interesting bit on Soundex plus Levenshtein here: https://info.crunchydata.com/blog/fuzzy-name-matching-in-postgresql