Closed srashtchi closed 2 years ago
It seems this .get_representative_docs
method return error when using gpu acceleration mode. with base BERTopic it returns sample docs. Any chance you could tell mw on above example when using GPU model how can I get same functionality.
The .get_representative_docs
method is used only for the CPU-version of HDBSCAN. Whenever you pass a different cluster model from a different package, the representative documents do not get calculated. Since HDBSCAN does this in a specific way, this is not generalizable to all clustering algorithms. With respect to the GPU-version of HDBSCAN, this is still under discussion about what will and what will not be supported in future versions of BERTopic and what functionalities are actually possible to include at this stage of the development of both packages.
Thanks Maarten for the clarification, it was really helpful.
Hi Maarten
I have been trying to get the sample docs from a topic model, below is the code up to the point where model is
.fit_transform
ed .then when I try to see the representative docs, I tried
.get_representative_docs
method, and it returns below error:then I thought maybe I need to
._save_representative_docs
first as below, which again returns error:Thanks in advance Shabnam