Closed MichaelAquilina closed 10 years ago
The stopword list used in nltk is rather short and does not filter out words like "also" which can increase the size of the index and does not really contribute any information. See 'stopwords.txt' as an example of a stopwords list to make use of.
The stopword list used in nltk is rather short and does not filter out words like "also" which can increase the size of the index and does not really contribute any information. See 'stopwords.txt' as an example of a stopwords list to make use of.