nlesc-sherlock / analyzing-corpora

Using NLP to analyse large collections of documents.
Apache License 2.0
0 stars 0 forks source link

Replace pre-processing (low priority) #12

Open c-martinez opened 8 years ago

c-martinez commented 8 years ago

Replace language-specific pre-processing for NLTK. We need this to be able to use documents translated (see #11) to Dutch.

This has low priority since it is more important to get the LDA running more efficiently