freezing / kaggle-product-search

Estimation of product relevance for the given search query.
0 stars 0 forks source link

Create stem dictionary from training and test set #19

Open freezing opened 8 years ago

freezing commented 8 years ago

Stem dictionary can be created using only words from the training and test data.

Some dummy algorithm could look like:

freezing commented 8 years ago

https://xapian.org/docs/stemming.html

freezing commented 8 years ago

This can help with plurals.

freezing commented 8 years ago

Very interesting for word similarity: http://orion.lcg.ufrj.br/Dr.Dobbs/books/book5/chap08.htm