dask / dask-ml

Scalable Machine Learning with Dask
http://ml.dask.org
BSD 3-Clause "New" or "Revised" License
890 stars 255 forks source link

Example: Mixed datatypes with ColumnTransformer #298

Open mrocklin opened 6 years ago

mrocklin commented 6 years ago

It would be useful to include an educational example of how to handle data with mixed datatypes, including both text and numerical data. This would possibly force some development on ColumnTransformer, ensuring that that gets built out and that the API is reasonable.

This came out of conversation with @ogrisel

TomAugspurger commented 6 years ago

I'm looking at this today.