-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
I am using beropic with llama3.1 for topic modelling. My text is long, so I use doc_…
-
Thank you for sharing it with community great tool and I would say it is UMAP+HDBSCAN on steroids!
Quick question though, when I try to cluster 30k of text embeddings, I am getting a lot of the tex…
-
Hi, Thanks again for your great tool,
I have a question regarding predefined Topics, whenver I add a list of **zeroshot_topic_list**, I got different generated topics and not the one I added, is th…
-
Hello Rosella team,
I notice that hdbscan also replies on python but actually there is a beautiful rust implementation (https://github.com/petabi/petal-clustering) and is paralleled when necessary.…
-
As required by [cuml's development guide](https://github.com/rapidsai/cuml/blob/61f85a6717d4d498ac5bb0815ca084c5f151fb00/wiki/python/DEVELOPER_GUIDE.md#creating-python-estimator-wrapper-class), estima…
-
Hi all,
The read_clustering step failed for whole operons (16S-ITS-23S, ~4.1k mean read length).
The whole nextflow pipeline ran through without errors (on a Mac M1) with test fastq files `mock4…
-
Hello,
I'm working with a very large dataset consisting of 7.5 million rows and 18 columns, which represents customer purchase behavior. I initially used UMAP for dimensionality reduction and attem…
-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
![newplot (3)](https://github.com/user-attachments/assets/2c46995c-ccce-4dd5-8816-…
-
Dear cuml team,
I am utilizing BERTopic for topic modeling. I understand that when I import UMAP from umap, and HDBSCAN from hdbscan, I can reproduce the results of topic modeling by setting random…
-
Hi!
If I try HDBSCAn clustering (on UMAP data) in 0.12.2 I have error message ("There was an unkown error.").
If I save the .ic file, open it in 0.12.1, then HDBSCAN runs perfect with same data, sam…