kmpoon / hlta

Provides functions for hierarchical latent tree analysis on text data for hierarchical topic detection
GNU General Public License v3.0
81 stars 23 forks source link

How to fit model to unseen document? #13

Closed Benja1972 closed 4 years ago

Benja1972 commented 4 years ago

In LDA methods after creation of topics model we can fit unseen document and get topics distribution on it. I wonder if it is possible with HLTA model? How to annotate unseen document by topics already created by HLTA model?

Thank you

kmpoon commented 4 years ago

You may use the Doc2Vec Assignment to do so. You will need to convert the unseen document to a binary vector as the input data. Each element in the vector corresponds to a word (in the same order) in the vocabulary you used to train the model.

Benja1972 commented 4 years ago

Thank you! Very informative, I will try