solvvy / redact-pii

Remove personally identifiable information from text.
MIT License
189 stars 57 forks source link

regex are not working very good #39

Open danmihaila opened 4 years ago

danmihaila commented 4 years ago

We are trying to use the redact-pii which is working good, but when doing some checks on files I am finding some strange results: 1. original data: _yes,hiraan,belet_weyne,no,RE-HRN-BTW-13,male,age41_59,eldery_male,elderymale,14,no,0.250750683890174 data after redact-pii applied: _yes,hiraan,belet_weyne,no,RE-HRN-BTW-13,male,age41_59,eldery_male,eldery_male,14,no,0.CREDIT_CARDNUMBER

removing the credit card number rule, it founds something else: 2. original data: _yes,hiraan,belet_weyne,no,RE-HRN-BTW-13,male,age41_59,eldery_male,elderymale,14,no,0.250750683890174 data after redact-pii applied: _yes,hiraan,belet_weyne,no,RE-HRN-BTW-13,male,age41_59,eldery_male,eldery_male,14,no,PHONENUMBER90174

Any hints?