luozhouyang / python-string-similarity

A library implementing different string similarity and distance measures using Python.
MIT License
991 stars 127 forks source link

Added SIFT4 #23

Closed matthijsz closed 4 years ago

matthijsz commented 4 years ago

I've added a Python implementation of SIFT4 as described here: https://siderite.dev/blog/super-fast-and-accurate-string-distance.html/ The code included in this fork is a Python translation of the JavaScript code for Extended/General SIFT4 described on the website. Usage is similar to the existing methods of the package, i.e.: a = SIFT4() a.distance('abcdef', 'bcdea')