krayzpipes / txt-ferret

Identify and classify data in your text files with Python.
Apache License 2.0
4 stars 1 forks source link

SSN Number not identified when scanned with this tool. #46

Open tMare-05 opened 3 weeks ago

tMare-05 commented 3 weeks ago

Valid SSN numbers are not identified and flagged when this tool is scanned with the pattern to identify using following regular expressions: ?!666|.+0{4})(?:\d{9}|\d{3}-\d{2}-\d{4} (?!666|.+0{4})(?:\d{9}|\d{3}-\d{2}-\d{4}) \b(?!666|.+0{4})(?:\d{9}|\d{3}-\d{2}-\d{4})\b

while default.py doesnt have a default pattern for regular expression like various credit cards that are validating successfully. Are there any directions as to how LUHN algorithm or the code in sanity.py handles & does matching with data present in txt files against regular expressions.

tMare-05 commented 3 weeks ago

Requesting it since the tool claims to validate SSNs along with credit card numbers