GateNLP / gateplugin-LearningFramework

A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, NaiveBayes, CRF and others), LibSVM, Scikit-Learn, Weka, and DNNs through Pytorch and Keras.
https://gatenlp.github.io/gateplugin-LearningFramework/
GNU Lesser General Public License v2.1
26 stars 6 forks source link

Add support for topic models by wrapping gensim #71

Open johann-petrak opened 6 years ago

johann-petrak commented 6 years ago

This will need an even simpler "corpus representation" for text (list of tokens) only.

johann-petrak commented 6 years ago

A quicker way and easier to use may be to use Mallet's LDA module. This requires a different Mallet Corpus Representation (based on FeatureSequences and no targets) and may not scale as well as using an OOM representation with a GenSim backend, but would not need any additional software installed and automatically port to Windows.