freezing / kaggle-product-search

Estimation of product relevance for the given search query.
0 stars 0 forks source link

Remove STOP words #22

Open freezing opened 8 years ago

freezing commented 8 years ago

http://www.lextek.com/manuals/onix/stopwords1.html remove them using dictionary, but be careful of words that are representing model number: e.g. R 19. Don't remove R.

or A-12

Remove stop words such as: for, is, when, in, on, the. This should be done in lexer or even before.