jhudsl / intro_to_r

A 2-week introduction to R programming course, with a focus on public health datasets
https://jhudatascience.org/intro_to_r/
MIT License
12 stars 6 forks source link

Less datasets overall? #487

Open carriewright11 opened 6 months ago

carriewright11 commented 6 months ago

moving Ava's comment here:

I know we've gotten feedback before about how there are a lot of datasets, and they wish there were a few we could get really familiar with. Any way we could cut down on them somehow? I think this would probably take some work though.

carriewright11 commented 6 months ago

Perhaps we can make a chart of how many datasets we used per lecture and what they are as a starting point?

carriewright11 commented 6 months ago
Topic # of Dataset(s) Dataset(s) Notes
Basic R
Basic R Lab
Data Input
Data Input Lab
Subsetting 3 (only a few slides for specific lessons are not the annual dosage dataset) annual dosage opioid dataset from OCSdata and mtcars as a classic dataframe example with rows, one fake example of a dataset with spaces in column names for one slide added the data to the website so people can practice import
Subsetting Lab 1 county pop from opioid dataset from OCSdata added data to the website to simplify getting the data and encourage data import skills
Summarization
Summarization Lab
Classes 1 (and vectors for simple illustration) circ from jhur useful to use vectors for class conversion (rare to have every kind of data in one datset)
Classes Lab 2 fake vectors and circ from jhur
Cleaning 2 bike from jhur and fake one for recoding hard to find example for how much I wanted to recode
Cleaning Lab 2 bike from jhur and fake one for recoding similar example to lecture but slightly different
Manipulating
Manipulating Lab
Intro to Data Viz
Intro to Data Viz Lab
Data Viz
Data Viz Lab
Factors
Factors Lab
Data Output
Data Output Lab
Functions
Functions Lab