Open DeanSherwin opened 6 years ago
Just putting this here so it doesn't get forgotten. We should look into the process of 'Stemming' which remove prefixes and suffixes from words eg. 'ing'. 'ed' etc....
Whoosh search engine does this
stemming algorithms such as Porter and Porter2, Paice Husk, and Lovins.
These are available in the Whoosh source code.
Lemmatization is a similar, less forceful process.
Just putting this here so it doesn't get forgotten. We should look into the process of 'Stemming' which remove prefixes and suffixes from words eg. 'ing'. 'ed' etc....