mimno / Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
https://mimno.github.io/Mallet/
Other
984 stars 344 forks source link

Doesn't PolylingualTopicModel.java include a source about --evaluator-filename? #181

Closed i-kohey closed 4 years ago

i-kohey commented 4 years ago

I did --evaluator-filename of PolylingualTopicModel, but the evaluator-file was not created. I checked PolylingualTopicModel.java and couldn't find any mention of --evaluator-filename.

I want to use PolylingualTopicModel's --evaluator-filename. How do I run the command --evaluator-filename in a PolylingualTopicModel on Mallet?

mimno commented 4 years ago

Thanks for spotting this! The command option was there but it was never implemented. I've added code to save one evaluator per language. Note that these are monolingual, so you will be able to evaluate the log probability of each language individually, but you will not be able to evaluate log probability for a multi-lingual doc as a whole.

i-kohey commented 4 years ago

Thank you so much! I will try to use it.