-
Hi,
When I run BERTopic using `model.fit_transform` on my dataset, it will return cluster numbers and titles that make sense for the input data. When I run the same string through the trained model…
-
Hello,
is it possible to run HDBSCAN on a graph without having to convert the graph into a sparse matrix first ?
Currently we convert the graph into a sparse matrix and use the "precomputed" metri…
-
Hi,
I have a corpus containing around 0.5 million English sentences.
I would like to perform clustering, which should automatically group similar sentences, and return number of groups along wi…
-
Hi,
when I run HDBSCAN I see that only one CPU core is used at 100%.
Would it be possible to implement HDBSCAN in a way that it uses multiple cores?
Thanks
Philip
-
-
I was trying to cluster on a dataset with about 460,000 points and ran into a memory error when `.fit` was called.
This was the pertinent code:
```Python
eps = .05 / 6378
clusterer = (hdbsca…
-
I completed the topic modeling as the tutorial recommend, but only find the topic seems far from proportional.
You see the first topic account for 70+% but the other topics account for very little i…
-
Hi, Thank you so much for creating such a cool and interactive and highly modular technique in Topic Modeling.
I have a question and a recommendation.
Regarding the question, I'm curious if ther…
-
Thanks for sharing your interesting work.
I would recommend using [LargeVis](https://github.com/lferry007/LargeVis) instead of t-SNE to get the low-dimensionality representation.
-
### BERTopic in Weaviate for Cluster Analysis
### What
Large text datasets tend to be composed of many different topics. For example, Wikipedia contains text about animals, sports, medicine, and man…