hdbscan-clustering-algorithm Search Results

300 results
for hdbscan-clustering-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MaartenGr/BERTopic #826

Multilabel Supervised Learning

I'm training a supervised model where I have (for example) 100 documents and 20 topics. Some of the documents can have multiple topics assigned to them, moreover the documents cannot be split into sm…

roelvanderburg updated 1 year ago
1
MaartenGr/BERTopic #880

Predict documents with River's DBSTREAM

Hi Maarten, you really built a very cool library here. Your work is very appreciated. I am using Bertopic with River library's DBSTREAM as the cluster model. As online modeling isn't quite what I am…

MStefanPaulus updated 1 year ago
5
MaartenGr/BERTopic #151

help sought to train a big data sentence model (upto 1.5 mi…

Hey Maarten, Firstly thank you for all the help you have been uptill this point! 👍 👍 👍 I want to visualise the top topics using the same logic you so nicely showed here https://github.com/MaartenG…

schetudiante updated 1 year ago
47
mhahsler/dbscan #16

Document parameters better when benchmarking

You do not give details on the indexes chosen in the comparison methods. From the runtime numbers, it appears that you did not add enable an index in ELKI? Do the runtimes include JVM startup cost, R …

kno10 updated 1 year ago
10
MaartenGr/BERTopic #892

Possibility of re-assigning document to another topic after …

Hi Maarten, I just have couple of questions- 1. After we fit and transform the model, it produces the main topic assigned to each document. If we find that theres a document (e.g. document 1) wh…

aelb66 updated 1 year ago
4
MaartenGr/BERTopic #765

"get_representative_docs"

Hi Maarten I have been trying to get the sample docs from a topic model, below is the code up to the point where model is `.fit_transform`ed . ``` from sklearn.datasets import fetch_20newsgroup…

srashtchi updated 2 years ago
3
MaartenGr/BERTopic #700

top documents with probability of 1.0 for each topic

hi Maarten, I'm trying to train BERTopic with docs and extract top 30 documents with highest scores (descending order of doc_probs) for each topic as follows: `doc_topics, doc_probs = topic_mode…

yanfan0531 updated 2 years ago
2
MaartenGr/BERTopic #658

Hierarchical Visualization of the topics using HDBSCAN

Hello, Thank you for this fantastic work, Bertopic is really useful. I was wondering why is the visualization of the hierarchy based off the results of the c_tf_idf ? Since the HDBSCAN results is …

e-barrere updated 2 years ago
2
AlineTalhouk/diceR #158

extracting gene list from a specific cluster

I am having trouble in extracting the gene list which belong to a specific cluster. Is there a code or function which is already present to extract the genes?

aniketbroad2604 updated 2 years ago
1
MaartenGr/BERTopic #686

Doubt about the threshold to assing the documents to a speci…

Hi Maarten, more than a problem it is a doubt. What is the threshold of the probabilities to assign a register (a document) to a specific topic? That is, from what value in the probability of belo…

felipelopezp726 updated 2 years ago
2

上一页 1...18 19 20 21 22 23 24...30 下一页

300 results for hdbscan-clustering-algorithm

300 results
for hdbscan-clustering-algorithm