amueller / scipy-2016-sklearn

Scikit-learn tutorial at SciPy2016
Creative Commons Zero v1.0 Universal
515 stars 516 forks source link

Adding IMDb to dataset to fetch script #31

Closed rasbt closed 8 years ago

rasbt commented 8 years ago

The sklearn.dataset.load_files function works like a charm. Loads the 50,000 files in ~5 sec (on my laptops, SSD). Feel free to merge now, the use case (out of core learning) will follow in a separate pull request ...

amueller commented 8 years ago

btw, we are currently downloading but not using the sentiment 140 dataset, right?

rasbt commented 8 years ago

That's right, I think. I can take it out now.