-
Hi. I tried running BERTopic on Google Colab cloud GPUs. The embedding is blazingly fast, compared to what I have been getting on my CPU server.
Unfortunately, at the very end of the .fit_transform…
-
Many thanks to this project, which provides much help in text analysis, especially for social science researchers less familiar with code.
However, I have encountered a problem with BERTopic that h…
-
When checking the Topics over time graph the first cluster is missing.
The displayed clusters are ` [-1, 1, ..., N ]` .
Cluster "0" is missing but it exists when printing `topic_model.get_topic_info…
zynos updated
3 years ago
-
#### Describe the bug
#### Steps/Code to Reproduce
```
from sklearn.neighbors._base import UnsupervisedMixin, SupervisedFloatMixin
```
#### Expected Results
Be able to import Uns…
-
> Difficulty: ★★☆☆☆
## Background
`Dataset.Tabular` and `Dataset.Image` both have the methods `to_numpy()` and `to_pandas()`.
`Dataset.Image` is read into memory using the Pillow library. Imag…
-
#### Describe the issue linked to the documentation
I was looking for a defition of "raw text documents" but could not find it in the documentation. Being unable to feed strings and lists of string…
-
#### Describe the bug
When using the pairwise_distances to compute euclidean distance, I noticed that the output matrix has 0 values outside the main diagonal. I examined the output to understa…
-
```
---------------------------------------------------------------------------
ValueError Traceback (most recent call last)
in
----> 1 from bertopic import BERTop…
-
**Link to the notebook**
https://github.com/aws/amazon-sagemaker-examples/blob/master/introduction_to_applying_machine_learning/ntm_20newsgroups_topic_modeling/ntm_20newsgroups_topic_model.ipynb
…
-
Hello i want to explain multiclass text classification
I use sklearn CountVectorizer and MultinomialNB
How i can do it
```
cv = CountVectorizer(stopwords)
nb = MultinomialNB(alpha=.01)
```