-
First, thanks for this great tool. I'm trying to learn skweak for full document classification. I found your sentiment example ("weak_supervision_sentiment.py") bit too complicated and slow (because o…
-
#### Describe the bug
Unexpected results when using a list of dictionaries with RandomSearchCV. Hard to describe, please just look at the output below.
#### Steps/Code to Reproduce
```pyt…
-
Hi @MaartenGr ,
I found the result of keywords in Chinese most are adjectives。 Can i filter it by the part of speech?
Another thing there are some number and punctuation in the result。
-
Hey, I saw this issue and I wanted to get the P(word|topic)
https://github.com/MaartenGr/BERTopic/issues/144
You suggested accessing it using `model.c_tf_idf`, but I still need the words that were…
-
#### Describe the bug
MLPClassifier( ) is returning different weights when trained on Linux versus windows with same class instantiation parameters and same data.
parameters:- activation='relu',…
-
-
Hi!
I am experimenting with a paragraph/sentence-based approach of implementing BERTopic.
I have a corpus that contains 4158 docs. I've split it into sentences (get 633003 sentences) and tried to …
-
Good afternoon. Thank you for such a great package!
Is it possible to implement AnchorsText explainer with a model which takes in Tokenizer.texts_to_sequences data?
**My current implementation:*…
-
In assessment 3 of the edX course, fit_tfidf() failed with a memory error. There's a report from another student in the edX discussion, so I am not the only one having this problem.
```
import os…
-
#### Description
I run the **linear_model.LinearRegression()** function. I have filtered the nan and inf value by `numpy.nan_to_num(X)`.
#### Steps/Code to Reproduce
```
.…