SMaRTWorkshops / mlr

Materials for the "Machine Learning in R" Workshop
https://smartworkshops.github.io/mlr/
Other
2 stars 1 forks source link

Dataset Ideas #7

Closed ShirleyBWang closed 3 years ago

ShirleyBWang commented 3 years ago

I started looking for some open datasets we could use for live coding, group activities, etc. Eiko has a bunch on his website that look promising (https://eiko-fried.com/data/) -- these two stood out to me:

Another option is to just simulate our own data and put labels on them to make them seem psychological! What do you think?

jmgirard commented 3 years ago

For explanatory examples, simulated data is great. But for activities, I think adding real data would be more interesting when possible.

Here are a few additional ideas: https://www.kaggle.com/rashikrahmanpritom/heart-attack-analysis-prediction-dataset https://www.kaggle.com/kwadwoofosu/predict-test-scores-of-students https://www.kaggle.com/mirichoi0218/insurance https://www.kaggle.com/ddmasterdon/income-adult https://www.kaggle.com/sjleshrac/airlines-customer-satisfaction https://www.kaggle.com/blurredmachine/are-your-employees-burning-out