Implemented text preprocessing functionalities through the Corpus and Document classes.
Implemented the ARTM method as described in this paper, with a few modifications. Mainly the way tf-idf is calculated. My implementation is equivalent to scikit-learn's TfidfVectorizer.
Updated dependencies and added nltk
Implemented text preprocessing functionalities through the Corpus and Document classes.
Implemented the ARTM method as described in this paper, with a few modifications. Mainly the way tf-idf is calculated. My implementation is equivalent to scikit-learn's TfidfVectorizer.
Added docs and examples
Closes #44