MichaelAquilina / SpamFilter

Classification of emails using machine learning and natural language processing techniques in Java
5 stars 4 forks source link

Collapse numbers in the inverted index #2

Closed MichaelAquilina closed 10 years ago

MichaelAquilina commented 10 years ago

It may be useful to collapse numbers based on their format:

MichaelAquilina commented 10 years ago

https://gist.github.com/KillaW0lf04/d430834e07b4e7aa3901