Since the script automatically merges exactly duplicate rows, it needs to be careful about overzealously merging rows that have too much missingness to reliably ensure that they refer to the same identity. E.g., the script currently looks for this:
last_name is not null AND
(ssn is not null
OR (first_name is not null AND dob is not null))
Since the script automatically merges exactly duplicate rows, it needs to be careful about overzealously merging rows that have too much missingness to reliably ensure that they refer to the same identity. E.g., the script currently looks for this: