dgrtwo / fuzzyjoin

Join tables together on inexact matching
Other
668 stars 61 forks source link

Feature request: report distances from stringdist_join() and friends #10

Closed lmullen closed 8 years ago

lmullen commented 8 years ago

For the types of fuzzy joins that have some kind of a distance metric (e.g., a string distance for stringdist_join() and friends), would it be possible to have an option to return the distance measure as a new column in the resulting data frame?

dgrtwo commented 8 years ago

Good idea!

dgrtwo commented 8 years ago

Added distance_col option to those functions (and ability for match_fun in fuzzy_join to return multiple columns generally). Let me know how it works out!

lmullen commented 8 years ago

Works brilliantly on some test data. I'll give it a shot with the real data. Thanks so much!