HaveIBeenPwned / EmailAddressExtractor

A project to rapidly extract all email addresses from any files in a given path
BSD 3-Clause "New" or "Revised" License
64 stars 23 forks source link

Handle email addresses encapsulated in double quotes #36

Closed troyhunt closed 1 year ago

troyhunt commented 1 year ago

Found another fairly major processing glitch there's now tests for in https://github.com/HaveIBeenPwned/EmailAddressExtractor/commit/84ee6089600eea82fd98ded17ec5501bec5ce22a. By running this app back to back with my oldie, I'm getting a good sense of how well the result sets align.

FWIW, I'm currently running it against this data set 😲 https://twitter.com/troyhunt/status/1651140792701059072

Lot of stuff like this:

a:6:{s:5:\"email\";s:23:\"test@example.com\";s:8:\"password\";s:8:\"an

hiteshbedre commented 1 year ago

Picking this up🚀

troyhunt commented 1 year ago

Perfect!