giacbrd / ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
GNU Lesser General Public License v3.0
198 stars 30 forks source link

somehow combine fastText with doc2vec #4

Open giacbrd opened 8 years ago

giacbrd commented 8 years ago

fastText supervised model does not take into account of the document and words representation, it just learns bag of words and labels. embeddings are computed only on the relation word->label. it would be interesting to learn jointly the semantic relation label<->document<->word<->context. for now it is only possible to pre-train word embeddings and then use them as initial vectors for the classification algorithm

giacbrd commented 7 years ago

see https://github.com/Skarface-/fastText_doc2vec