maximtrp / bitermplus

Biterm Topic Model (BTM): modeling topics in short texts
https://bitermplus.readthedocs.io/en/stable/
MIT License
77 stars 13 forks source link

The vocabularies input into BTM #25

Closed Chen-X666 closed 2 years ago

Chen-X666 commented 2 years ago

Hi, @maximtrp, I am trying to use bitermplus for topic modeling. However, i find the vocubulary input to BTM will filter the single word. At the angle of english, such is a good approach to filter the meaningless vocubularies, but at the angle of other language, like chinese, some single vacubularies are meaning for semantic understanding, did u provide an interface to close such filter mechanism. I appreciate if you advise for that.

maximtrp commented 2 years ago

Hello! Could you please provide a minimal example with this word?