jbenz / opendlp

Automatically exported from code.google.com/p/opendlp
0 stars 0 forks source link

Scanner not finding no spaces SSNs within Word 2010 docx files? #121

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1.Save a valid SSN into a Word 2010 .docx file with no spaces (ex:5555555555)
2.Save a valid SSN into a text file (5555555555)
3.Create a regex using anything that should find a SSN with no spaces
4.Run a scan

What is the expected output? What do you see instead?
What I did was create three files with a valid SSN in three different formats 
using MS Word 2010 - dashes, spaces, and no spaces.  I would put any regex I 
could find that would supposedly find no spaces SSN's but it would never work.  
But, if I saved the files as a .txt file - it would always catch them.  
Something about the .docx extension maybe?

What version of the product are you using? On what operating system?
Newest VM downloaded from the site.

Please provide any additional information below.

Original issue reported on code.google.com by geekfe...@gmail.com on 21 Aug 2014 at 5:55