xdrop / fuzzywuzzy

Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
GNU General Public License v2.0
822 stars 118 forks source link

add similarity sort algorithm #107

Open numdouglas opened 1 month ago

numdouglas commented 1 month ago

The current sort algorithm used to sort the tokenised strings is alphabetical sort. This has shortcomings such as when two similar words begin with different letters. I propose the addition of Levenstein sort on the tokens themselves to improve the accuracy in such instances.