-
I've applied the latest Annif yso-fi model to large corpus of Finnish thesis abstracts. I noticed that term "Määri" (https://finto.fi/yso-paikat/fi/page/p124541), which is a Czech province, comes up s…
-
#6372 adds `get_feature_names` to `PolynomialFeatures`. It accepts a list of names of `input_features` (or substitutes with defaults) and constructs feature name strings that are human-readable and in…
-
## やること
- 他の学習済みモデルでベクトルを作ってみる
- twitter
- google
-
First we should take a look into the data we have by analysing keywords and using tf-idf.
- [x] Determine the top 20 words (unigrams) per conference
- [x] Determine the top 20 bigrams per conference…
-
Hi. I tried running BERTopic on Google Colab cloud GPUs. The embedding is blazingly fast, compared to what I have been getting on my CPU server.
Unfortunately, at the very end of the .fit_transform…
-
#### Describe the bug
#### Steps/Code to Reproduce
```
from sklearn.neighbors._base import UnsupervisedMixin, SupervisedFloatMixin
```
#### Expected Results
Be able to import Uns…
-
Many thanks to this project, which provides much help in text analysis, especially for social science researchers less familiar with code.
However, I have encountered a problem with BERTopic that h…
-
> Difficulty: ★★☆☆☆
## Background
`Dataset.Tabular` and `Dataset.Image` both have the methods `to_numpy()` and `to_pandas()`.
`Dataset.Image` is read into memory using the Pillow library. Imag…
-
When checking the Topics over time graph the first cluster is missing.
The displayed clusters are ` [-1, 1, ..., N ]` .
Cluster "0" is missing but it exists when printing `topic_model.get_topic_info…
zynos updated
3 years ago
-
```
---------------------------------------------------------------------------
ValueError Traceback (most recent call last)
in
----> 1 from bertopic import BERTop…