'Merge_Models' with new topic_model from outliers

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

MIT License

6.19k stars 765 forks source link

When merging the Topic_model (including all data, with outliers) and the Out_Topic_model (consisting only of outliers), the 'Count' of the Topic_model for -1 increases by the number of outliers, instead of effectively concat them.

I have a hard time understanding what you exactly mean here. Could you give an example? Perhaps showcase what is happening and what you would expect to happen?

The Representative_docs are displayed as NaN. => is the only way?

The representative documents are indeed displayed as NaN since merge_models is also meant for federated learning. If you want representative documents re-calculated, I would advise checking the issues page. I believe there are a number of issues that describe in detail how you can do this.

MaartenGr / BERTopic

'Merge_Models' with new topic_model from outliers #2222