acwooding / ReproAllTheThings

Showcase for the Easydata framework (and regular test case) created by reproducing the EmbedAllTheThings notebooks by jc-healy.
Other
1 stars 2 forks source link

Change source of the datasets #36

Open acwooding opened 3 years ago

acwooding commented 3 years ago

Use OpenML instead of Kaggle? Switch to 150k?

https://www.openml.org/d/42074

This would help with some of the downloading confusion with the .zip file unpacking.

acwooding commented 3 years ago

Try out

from sklearn.datasets import fetch_openml data_openml = fetch_openml('wine_reviews') data = pd.DataFrame(data_openml.data)