Closed bbowler86 closed 7 years ago
I solved my own problem, I just changed the second to last line from:
X = title_rf_vectorizer.fit_transform(feature_vector["Classification"]).toarray()
to
X = title_rf_vectorizer.fit_transform(feature_vector["FeatureVector"]).toarray()
Sorry about that. You can consider this issue closed.
Hey, great tool.
I have a problem though when I am trying to use a TfidfVectorizer for Text Classification. When I create a Single Base Learner I get the error:
ValueError: all the input array dimensions except for the concatenation axis must match exactly
.The type of the
X
variable is annumpy.ndarray
, but if I don't convert the variableX
to an array then I get the error message:TypeError: Singleton array array(<92820x194 sparse matrix of type '<class 'numpy.float64'>' with 92820 stored elements in Compressed Sparse Row format>, dtype=object) cannot be considered a valid collection.
I choose the preset learner setting
scikit-learn Random Forest
as a Base Learner Type.