countvectorizer Search Results

1000+ results
for countvectorizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MaartenGr/BERTopic #267

Memory issues with countvectorizer ngram(1,2) and not ngram(…

Hi, I have an issue when i try to fit_transform a list of 100,000 documents with countvectorizer , when I use an ngram(1,3) no memory error shows, but when I use ngram(1,2) i have this error : c:\…

doubianimehdi updated 3 years ago
7
MaartenGr/BERTopic #286

lemmatization in BERTopic

Hi there! One of the topics BERTopic extracted for me is ```2_printer_print_printing_printers```, and I was wondering, does BERTopic do some sort of lemmatization (I think that's what would help me…

MitraMitraMitra updated 3 years ago
1
UBC-MDS/canadian_heritage_funding #66

Feedback to work on

We need to pick at least four feedback according to which we can make improvements on our project. Let's discuss!

aimee0317 updated 2 years ago
7
MaartenGr/BERTopic #255

Lemmatization code snippet example

For those also searching the issues for lemmatization, this code seems to work ``` # Lemmatization from sklearn.feature_extraction.text import CountVectorizer import nltk nltk.download("punkt")…

bluepeter updated 3 years ago
2
MaartenGr/BERTopic #377

Stop words in corpus of large texts

1. Should stop words be removed from corpus beforehand? My topic_model generates clusters with most frequent words like "the", "and", "to" and etc. 2. Is there any model to process long text withou…

DKanarsky updated 2 years ago
3
UBC-MDS/canadian_heritage_funding #6

EDA discussion

Opening this issue to discuss which plots are necessary or what should be changed to show that our data is appropriate for our analysis/prediction. We should also discuss whether the quantiles used…

jo4356 updated 2 years ago
9
MaartenGr/BERTopic #341

Error when importing BERTopic

Can someone please let me know how can i get rid of this error. I tried installing torch==1.9.0 and torch==1.8.0 but none of them work. ImportError: cannot import name 'SAVE_STATE_WARNING' from 'to…

Besteverandever updated 2 years ago
10
MaartenGr/BERTopic #277

Error while trying to adjust hdbscan parameters to reduce fr…

Hi, I'm having this error while trying to minimize -1 topic by fiddling around hdbscan parameters ``` 101099it [17:03:36, 1.65it/s] 2021-10-10 09:25:15,138 - BERTopic - Transformed documents …

doubianimehdi updated 3 years ago
8
mitre-attack/tram #39

ImportError: DLL load failed: The specified module could not…

I closed the repository and created a virtualenv environment for it and did the pip install -r requirements. Now when starting the server on windows 10 pro, I get the following error: ``` Tracebac…

stenpiren updated 3 years ago
4
scikit-learn/scikit-learn #13056

FastICA whitening problem (Bug)

#### Description When performing FastICA using whiten=True attribute, the resulted unmixed signals have a variance of 1/len(data). this can be handled by multiplying the unmixed signals by …

hafezmg48 updated 3 years ago
10

上一页 1...80 81 82 83 84 85 86...100 下一页

1000+ results for countvectorizer

1000+ results
for countvectorizer