A plugin for the GATE language technology framework for training and using machine learning models. Currently supports Mallet (MaxEnt, NaiveBayes, CRF and others), LibSVM, Scikit-Learn, Weka, and DNNs through Pytorch and Keras.
Train Topic Model: ideally this would also make use of the example pipelines from stringannotation for token filtering by stopwords and corpusstats for token filtering by tfidf, but how?
Initially: