Open abhennessy99 opened 2 years ago
If there is no match below the threshold in clean_vector, then in the cat_join output the values without a good enough match are just deleted and replaced with NA
data("clean_caterpillars") data("messy_caterpillars")
cat_join(messy_df = messy_caterpillars, clean_df = clean_caterpillars, by = c("CaterpillarSpecies", "species"), method="jaccard", threshold = .49,join="full")
added this to the testing document, will try and fix tomorrow
If there is no match below the threshold in clean_vector, then in the cat_join output the values without a good enough match are just deleted and replaced with NA
data("clean_caterpillars") data("messy_caterpillars")
cat_join(messy_df = messy_caterpillars, clean_df = clean_caterpillars, by = c("CaterpillarSpecies", "species"), method="jaccard", threshold = .49,join="full")