Closed carrollsa closed 5 months ago
Local testing on smaller datasets has worked as intended. Full dataset yields the following results. I'm fairly confident we aren't deleting anything unintended, but I'm still running another test to be sure.
Made a printout to show the contacts that would be deleted when running on prod data:
I just updated it to also include Jane Does, as prod data has 3. There are also instances of first_name = "NONAME"
, last_name = "(RED FLAG)"
and a julia NONAME
.
(RED FLAG)
but leaning toward a no. Even though red flag implies they want to track this person for some red flag reason, I don't think they need this person in salesforce.Latest changes will remove the following:
Closes https://github.com/CodeForPhilly/paws-data-pipeline/issues/588
Changes
pdp_contacts
before matching process beginsConsiderations
pdp_contacts
as a separate step before we start the matching process. I think the ideal would be to never insert these values intopdp_contacts
to begin with, but I don't think this is very expensive.