maximtrp / bitermplus

Biterm Topic Model (BTM): modeling topics in short texts
https://bitermplus.readthedocs.io/en/stable/
MIT License
77 stars 13 forks source link

Implementation Guide #21

Closed neel6762 closed 2 years ago

neel6762 commented 2 years ago

I was wondering is there any way to print the the topics generate by the BTM model, just like how I can do it with Gensim. In addition to that, I am getting all negative coherence values in the range of -500 or -600. I am not sure if I am doing something wrong. The issues is, I am not able to interpret the results, even plotting gives some strange output.

image

The following image show what is held by the variable adobe, again I am not sure if it needs to be in this manner or each row here needs to a list

image
maximtrp commented 2 years ago

There is no function for such gensim-like output, but you can do it yourself using the vocabulary and words vs topics matrix. Semantic coherence is calculated as a sum of logs of fractions, so negative values are absolutely normal.

neel6762 commented 2 years ago

Thank you for your attention.