giacbrd / ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
GNU Lesser General Public License v3.0
198 stars 30 forks source link

pre-train internally #9

Closed giacbrd closed 7 years ago

giacbrd commented 7 years ago

Given a training set, it is possible to train the word embeddings (unsupervised) first, and then doing the label learning. It is similar to pass a pre-trained model, but more transparent

giacbrd commented 7 years ago

added fit_embeddings(X) to GensimFastText in https://github.com/giacbrd/ShallowLearn/commit/09c25aae14a308bc817941d4e85044634118693e