Closed dfalster closed 4 months ago
@ehwenk Interesting bit from the stingdist
help files
The metric you need to choose for an application strongly depends on both the nature of the string (what does the string represent?) and the cause of dissimilarities between the strings you are measuring. For example, if you are comparing human-typed names that may contain typo's, the Jaro-Winkler distance may be of use. If you are comparing names that were written down after hearing them, a phonetic distance may be a better choice.
@wcornwell @dfalster
So much for my hour of coding - immediately the same output with stringdist, method = "dl". At least now I know my logic has an official name.
I'll remove my "wordy" code and run all tests.
interestingly, this probably makes #162 obsolete. Looks like part of their C++ magic is low-level parallelization.
see: https://www.rdocumentation.org/packages/stringdist/versions/0.9.12/topics/stringdist-parallelization
As noted by @wcornwell in https://github.com/traitecoevo/APCalign/issues/180#issuecomment-2087805540, we can swap out
adist
for a faster alternative