Open systemnova opened 2 years ago
Thanks for this! The "takeout" setting is not vectorized; it only takes a vector of length 1 (I will make a note to add this to the documentation.) You can only have it take out matches (or not) for the entire tier_match, not each tier.
To do what you want, you'll simply need to run a few different tier_match calls, each with different settings for takeout.
Thanks Chris, I will split up the matching into a few tier_match calls. However, separate to the non-vectorised takeout, it seems to not be consistently applying takeout = "both" across all tiers when not vectorised. For example, the following match evaluation result was produced from takeout = "both". It seems to work for the first three tiers, but doesn't remove results in tier 4, with 150 matches? Potentially because tier 4 swaps to be multivar match, after two fuzzy match tiers?
Thanks again for creating such an awesome package.
Ah, I see! Thanks for pointing this out. I am unfortunately headed out for vacation for a few weeks, so I won't be able to dive into this for a while. I am guessing that you're simply correct, and that it doesn't correctly pull out matches from multivar. For now, you'll just need to continue breaking up the steps. So you do your first three tiers in one tier match, then run a line to remove those ids from the data, then do a merge_plus for each multivar match that you want.
I apologize for this bug, and appreciate your help figuring it out.
I'm getting a few unexpected behaviours for tier_match using the code below. The results are ok for the first two tiers, but then on the later tiers it's ignoring the 'top =1' and the 'takeout=both' setting, and instead returning many more matches than rows in the tier called 'd_multi'? I also have a feeling I'm not using the takeout argument correctly, as the documentation indicates that it takes a character vector, but I cant get it to work with a vector of length >1? Is there a way to control how the takeouts work for each tier? Thanks for making such a powerful & useful set of functions!