Closed sanchez575 closed 9 years ago
The thresholdLCS function in /clean/deduplication/PassJoin.scala returns threshold + 1 when the edit distance between two strings is larger than threshold, which is different from what was implemented by the thresholdLevenshtein function.
I don't know if there is any problem with this difference.
It seems that "thresholdLevenshtein" has never been used. I think we should remove the "thresholdLevenshtein" function from PassJoin.scala.
Added Edit Distance.