-
`pure_sklearn.map.convert_estimator` function crashes during a conversion of a sklearn unit having a functional variable (a CountVectorizer with a preprocessor parameter in my case).
`ValueError: O…
-
#### Description
when I use 'mae' criterion for the model extratreesregressor, training for a long time, it's seems lead to an endless training. there have no problem for mse
I find not…
-
When you say NLP functionality in the contribution guidelines are you hinting towards a julia implementation of tfidf_vectorizer and count_vectorizer from sklearn?
I'd be very willing to help out w…
-
I am having some troubles converting code from previous typescript client (v2) to new client (3.0.5).
My confusion revolves around:
- Will the collection definition be the same? I am not sure what c…
-
Hi, Thanks for this great package.
right now I use `KeyphraseCountVectorizer` method to extract keywords based on different POS patterns.
Here is my code:
```python
def kph_extr(docs:list, patt:s…
-
```
import jieba
def tokenize_zh(text):
words = jieba.lcut(text)
words = list(filter(lambda x: (len(x)>1), words))
return words
import numpy as np
from umap import UMAP
from skle…
-
Hello i am running BERTopic on a mabook pro m1 with the following parameters using precomputed embeddings with sentence transformer
```
vectorizer_model = CountVectorizer(stop_words="engli…
-
I've set the random seed when I fit my topic model, and I'm getting reproducible results. I'm using the following:
```
def fit_reduce_model(rep_model, docs):
"""
Defines all component mo…
-
The parameter in Text Vectorizer, which is `pad_to_max_tokens` has been changes from `True` to `False`.
## From The Video:
![image](https://user-images.githubusercontent.com/54589605/186830908-96…
-
We are now allowing both upsampling and downsampling in our workflow, so we need to update some file names in our current notebooks to reflect this.
- [x] Both `upsample.ipynb` and `downsample.ipy…