coiled / data-science-at-scale

A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).
MIT License
112 stars 38 forks source link

Small datasets on binder #8

Open hugobowne opened 4 years ago

hugobowne commented 4 years ago

If learners use binder and thus need smaller versions of the data, use the same method as here: https://github.com/dask/dask-tutorial

well, here: https://github.com/dask/dask-tutorial/blob/master/prep.py