countvectorizer Search Results

1000+ results
for countvectorizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NorskRegnesentral/skweak #18

Simple example of full document classification and questions

First, thanks for this great tool. I'm trying to learn skweak for full document classification. I found your sentiment example ("weak_supervision_sentiment.py") bit too complicated and slow (because o…

kauttoj updated 3 years ago
2
scikit-learn/scikit-learn #20128

Strange results when using list of dicts as parameters in Ra…

#### Describe the bug Unexpected results when using a list of dictionaries with RandomSearchCV. Hard to describe, please just look at the output below. #### Steps/Code to Reproduce ```pyt…

sprusaka updated 2 years ago
1
MaartenGr/KeyBERT #67

how to filter keywords by the part of speech?

Hi @MaartenGr , I found the result of keywords in Chinese most are adjectives。 Can i filter it by the part of speech？ Another thing there are some number and punctuation in the result。

xesgue updated 3 years ago
4
MaartenGr/BERTopic #331

Is there a way to retrieve the words used to generate the tf…

Hey, I saw this issue and I wanted to get the P(word|topic) https://github.com/MaartenGr/BERTopic/issues/144 You suggested accessing it using `model.c_tf_idf`, but I still need the words that were…

sgdantas updated 2 years ago
4
scikit-learn/scikit-learn #20516

MLPClassifier and MLPRegressor returning different weights w…

#### Describe the bug MLPClassifier( ) is returning different weights when trained on Linux versus windows with same class instantiation parameters and same data. parameters:- activation='relu',…

jkquant updated 3 years ago
1
hirowatari-s/ExploreSearchSystem #6

データ加工

TetraMiyazaki updated 3 years ago
6
MaartenGr/BERTopic #302

Max number of docs/sentences

Hi! I am experimenting with a paragraph/sentence-based approach of implementing BERTopic. I have a corpus that contains 4158 docs. I've split it into sentences (get 633003 sentences) and tried to …

bohdanbaliuk2020 updated 3 years ago
4
marcotcr/anchor #74

Is it possible to use AnchorText with Tokenizer instead of C…

Good afternoon. Thank you for such a great package! Is it possible to implement AnchorsText explainer with a model which takes in Tokenizer.texts_to_sequences data? **My current implementation:*…

Enantiodromis updated 3 years ago
2
jonathandunn/text_analytics #6

edX assessment 3: fit_tfidf() fails with memory error

In assessment 3 of the edX course, fit_tfidf() failed with a memory error. There's a report from another student in the edX discussion, so I am not the only one having this problem. ``` import os…

qhiGenrRvta2 updated 3 years ago
1
scikit-learn/scikit-learn #9213

In linear_model.LinearRegression: ValueError: array must not…

#### Description I run the **linear_model.LinearRegression()** function. I have filtered the nan and inf value by `numpy.nan_to_num(X)`. #### Steps/Code to Reproduce ``` .…

linrio updated 3 years ago
1

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for countvectorizer

1000+ results
for countvectorizer