-
#### Describe the bug
Datatype specified in OneHotEncoder is not preserved when it is used with FeatureUnion.
#### Steps/Code to Reproduce
Please see this [gist](https://gist.github.com…
-
#### Description
`RandomUnderSampler` should permit undersampling a dataset containing NaNs. I don't see a reason this should be blocked by the def _check_X_y() function.
#### Steps/Code…
-
__UPDATE May 23 202__
Here's a list of the remaining classes:
- [ ] feature_selection.SelectorMixin
- [x] calibration.CalibratedClassifierCV ([#15134](https://github.com/scikit-learn/scikit-lea…
-
I am training a CountVectorizerModel in Spark, where it takes an array of strings as a input feature value. I then serialized the model using MLeap, and deserialize it in a pure Java environment and u…
-
Hello,
I think it is convenient to accept `list` as a value of `dict` in `DictVectorizer.fit()`
to accept multiple values for one categorical feature.
Assume that we use the category of movies as a …
-
**Rasa version**: 1.10.0
**Rasa SDK version**: 1.10.0
**Python version**: 3.6.5
**Operating system**: Windows 10
**Issue**: I have docker-compose file In that I have two container ras…
-
Right now when searching for specific estimators a page preview does not show properly. e.g.
Searching for `CountVectorizer scikit-learn` produces,
in DuckDuckgo
![Screenshot_2020-06-10 CountVe…
-
#### Describe the bug
When a SMOTE object resampling a Pandas DataFrame, the returned object from the .fit_resample() method should also be a Pandas DataFrame. Instead it is returned as a numpy array…
-
Hi,
I am currently doing experiments on a dataset classifying text document using Embedding, Conv1D and Dense layers.
```
from __future__ import print_function
import time
import warnings
…
-
Hi, i want to build a text clasffication model but i have gotten errors
```python
pipe = Pipeline([
('count_or_tf',PipelineHelper([
("count",CountVectorizer(tokenizer = spacy_tokenizer…