Closed ShirleyBWang closed 3 years ago
For explanatory examples, simulated data is great. But for activities, I think adding real data would be more interesting when possible.
Here are a few additional ideas: https://www.kaggle.com/rashikrahmanpritom/heart-attack-analysis-prediction-dataset https://www.kaggle.com/kwadwoofosu/predict-test-scores-of-students https://www.kaggle.com/mirichoi0218/insurance https://www.kaggle.com/ddmasterdon/income-adult https://www.kaggle.com/sjleshrac/airlines-customer-satisfaction https://www.kaggle.com/blurredmachine/are-your-employees-burning-out
I started looking for some open datasets we could use for live coding, group activities, etc. Eiko has a bunch on his website that look promising (https://eiko-fried.com/data/) -- these two stood out to me:
Another option is to just simulate our own data and put labels on them to make them seem psychological! What do you think?