Closed ervivek closed 3 years ago
Same issue here I am getting a high score for documents when training using fast learn but getting low documents scores while using deep learn. And the documents in topics are also not related.
If you are using embedding_model='doc2vec'
then the quality of the document and word embedding will depend on the size and quality of your dataset. I would recommend trying to use embedding_model='universal-sentence-encoder'
I am working with amazon reviews data. Topic model created as follows -
model = Top2Vec(documents=df1, speed="learn", workers=2)
Sharing a topic generated by this model -
Following are the top 20 documents corresponding to the above topic. Keywords mentioned in the above topic not found in the documents list. Can you please suggest corrective actions?