R007Wh04mi / theharvester

Automatically exported from code.google.com/p/theharvester
GNU General Public License v2.0
1 stars 0 forks source link

Case sensitive regex #9

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
The regexs in myparser.py arse case sensitive, I think it makes sense to make 
them case insensitive to yield more results.

To do this for emails change like 34 in myparser.py to:

reg_emails = re.compile('[a-zA-Z0-9.-_]*' + '@' + '[a-zA-Z0-9.-]*' + self.word, 
re.I)

^ re.I will also have to be added to the other regexs within myparser.py to 
make them case insensitive.

Original issue reported on code.google.com by ryandewh...@gmail.com on 6 Mar 2013 at 11:39

GoogleCodeExporter commented 9 years ago
Would probably want to change line 35 to lowercase the results before they are 
uniqued also:

self.temp = reg_emails.findall(self.results.lower())

Original comment by ryandewh...@gmail.com on 6 Mar 2013 at 11:51