countvectorizer Search Results

1000+ results
for countvectorizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MaartenGr/BERTopic #1711

ctfidf breaks down when specifying a vocabulary in CountVect…

In some cases, the stop_words parameter of the CountVectorizer is not enough to prevent certain non-desired words from coming through. For example, one may have the desire to filter out non-verbs like…

dannywhuang updated 5 months ago
2
campusx-official/movie-recommender-system-tmdb-dataset #6

ValueError: source code string cannot contain null bytes

Getting error while vectorizing the string i use -> ``` from sklearn.feature_extraction.text import CountVectorizer cv = CountVectorizer(max_features=5000,stop_words='english') ``` error I go…

Nikhilsinghbora updated 8 months ago
1
EyeofBeholder-NLeSC/orange3-argument #91

Error in running example.ipynb

After running the following code: # Compute topics of chunks chunk_topics, chunk_embeds, df_topics = chunker.get_chunk_topic(chunks=chunks) I got this error: InvalidParameterError: The 'ngram_…

atefekeshavarzi updated 7 months ago
10
StatguyUser/TextFeatureSelection #30

'CountVectorizer' object has no attribute 'get_feature_names…

`from TextFeatureSelection import TextFeatureSelection #Binary classification input_doc_list=new_df_4['txt'].values.tolist() target=new_df_4['target'].values.tolist() fsOBJ=TextFeatureSelection(ta…

primadermawan updated 7 months ago
4
MaartenGr/BERTopic #1817

Different number of topics for different training runs on th…

Hi, I am facing issue. If I train bertopic on a same dataset multiple times, I am getting different number of topics . As per the discussion in this thread: https://github.com/MaartenGr/BERTop…

abdullahfurquan updated 4 months ago
13
Zeta-and-Company/pydistinto #7

AttributeError: 'CountVectorizer' object has no attribute 'g…

When running run_pydistinto_beginners.py, I get another error saying that the attribute "get_feature_names" of "CountVectorizer" is not found. I have Python3.10.6 on Ubuntu 22.04.2 LTS ![grafik](http…

hennyu updated 1 year ago
2
onnx/sklearn-onnx #446

Add converter for CountVectorizer with "char_wb" analyzer

I've tried but this error occurred, `NotImplementedError: CountVectorizer cannot be converted, only tokenizer='word' is supported. You may raise an issue at https://github.com/onnx/sklearn-onnx/is…

cppntn updated 3 years ago
1
MaartenGr/BERTopic #1998

Which hyper parameter mostly influence the number of topics …

``` import jieba def tokenize_zh(text): words = jieba.lcut(text) words = list(filter(lambda x: (len(x)>1), words)) return words import numpy as np from umap import UMAP from skle…

fishfree updated 1 month ago
3
scikit-learn/scikit-learn #15588

CountVectorizer integration with ColumnTransformer is unintu…

Here is an issue that I came across with Count Vectorizer and its use with Column Transformer and Pipelines https://stackoverflow.com/questions/54541490/sklearn-text-and-numeric-features-with-colum…

beachkrp updated 2 years ago
1
scikit-learn/scikit-learn #14559

CountVectorizer sets self.vocabulary_ in transform

Right now CountVectorizer sometimes sets ``self.vocabulary_`` outside of ``fit``. We usually prohibit this, but the common tests haven't reached the vectorizers yet.

amueller updated 2 years ago
8

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for countvectorizer

1000+ results
for countvectorizer