giacbrd / ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
GNU Lesser General Public License v3.0
198 stars 30 forks source link

word n-grams #3

Open giacbrd opened 7 years ago

giacbrd commented 7 years ago

Given https://github.com/giacbrd/ShallowLearn/issues/1, implementing n-grams of words

giacbrd commented 7 years ago

do not need to implement. show an example with gensim.models.Phrases, something like:

In this example document features are word unigrams and bigrams, limiting the feature space to XXX with feature hashing.