countvectorizer Search Results

1000+ results
for countvectorizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MaartenGr/BERTopic #324

Linux kills process during fit_transform (memory error?)

First and foremost, thanks for a magnificent package! I'm fitting ~1.6 million tweets and embeddings (which I have pre-calculated; size ~9.6GB) using following the [FAQ on memory issues](https://ma…

Rysias updated 2 years ago
7
rapidsai/cuml #4219

[FEA] Supporting `get_feature_names` for `TfidfVectorizer`

**Is your feature request related to a problem? Please describe.** I'm looking to get similar functionality from [TfidfVectorizer ](https://docs.rapids.ai/api/cuml/stable/api.html#cuml.feature_extra…

mayankanand007 updated 2 years ago
2
UBC-MDS/DSCI_522_Spotify_Track_Popularity_Predictor #10

Feature Transformations Discussion

These are the column types that I have identified along with transformations for each column type. Does everyone agree with these transformations? #CountVectorizer text_features = "song" …

jessie14 updated 2 years ago
2
scikit-learn/scikit-learn #21242

CountVectorizer.transform() much slower since version 1.0

### Describe the bug Since version 1.0, calling `CountVectorizer.transform()` is more than 100 times slower compared to previous versions. I did some basic profiling and I think it is related to the …

sobayed updated 3 years ago
1
milvus-io/web-content #654

[Suggestion] Update the Milvus bootcamp page.

> Note: This repository is ONLY used to solve issues related to DOCS. > For other issues, please move to [other repositories](https://github.com/milvus-io/). **Is there anything that's missing or …

YiyunNi updated 3 years ago
1
scikit-learn/scikit-learn #12592

It seems that the constant_features attribute of class Split…

#### Description The constant_features attribute is created and malloced, save the constant features' index, but no code uses it, it just save. The tree use n_constant_features to get const…

ruiann updated 2 years ago
6
MaartenGr/BERTopic #423

Most documents assigned to -1 topic

Dear Maarten, many thanks for this great module, we are exploring it currently in our [research project](https://essl.leeds.ac.uk/politics/dir-record/research-projects/1178/understanding-normative-ch…

ViktoriaSpaiser updated 2 years ago
11
gregversteeg/corex_topic #24

How we can testing the model on new data ?

Hello, thank you for this tutoriel, i want to build a anchored model for text classification (i have 5 classes) sentences, so i trained an anchored model with 5 topic, but how can i test the model on…

Suhaib441 updated 2 years ago
11
MaartenGr/BERTopic #385

reduce_topics doesn't use stopwords?

I created a model and saved it, restored it, reduced the topics ``` vectorizer_model = CountVectorizer(ngram_range=(1, 3), stop_words="english") AllModel = BERTopic(vectorizer_model=vectorizer_mo…

drob-xx updated 2 years ago
1
scikit-learn/scikit-learn #7351

Can not use splitter.sample_weight in _add_split_node()

#### Description I want to use the splitter.sample_weight[i] in _add_split_node, but I got the segement fault error. I developed a new split criterion, the sample_weight can be positive or nega…

zjnsteven updated 2 years ago
3

上一页 1...78 79 80 81 82 83 84...100 下一页

1000+ results for countvectorizer

1000+ results
for countvectorizer