mimno / Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
https://mimno.github.io/Mallet/
Other
984 stars 344 forks source link

How to get the details of document categorize to each topic in topic modelling #177

Open rajesh8871 opened 4 years ago

rajesh8871 commented 4 years ago

Hi, Currently, I am trying to use the topic modelling for my data, but I am not been able to find any way to get all the result of document categorised under each topic.

mimno commented 4 years ago

The --output-doc-topics option will give you a spreadsheet view showing the estimated topic proportions of each document. The --output-topic-docs option will give you a file listing the most heavily weighted documents for each topic.