data-edu / dataedu

An R package associated with the Data Science in Education Using R book
https://data-edu.github.io/dataedu/
Creative Commons Attribution 4.0 International
74 stars 12 forks source link

add synthetic data and add raw and processed data folders and the data-raw folder #10

Closed jrosen48 closed 4 years ago

jrosen48 commented 4 years ago

@restrellado I made a raw and processed folder in data - would it make sense to add the CSV for tt_tweets into the processed (or the raw) data folder? Perhaps raw could be only the status_ids that we passed on to rtweet (if you think/agree this is helpful, I can add this).

jrosen48 commented 4 years ago

going to stop here with this one instead of adding other data here (will open another) - this was to add @emilybovee's sci_mo_data in exactly the same way she did, except (per this issue) the data is synthesized via synthpop. The purpose of the package is to make it so the relationships in the data are the same, though the specific values for an individual case should be different (synthetic). My big (only) concern is that any values that we interpret may be slightly different (and, I hope not very different). @emilybovee perhaps we can look together to make sure it's right.

Also, and part of why I want to not let this PR become ginormo...

I also used code in data-raw to add the data via usethis::use_data() - literally two lines (and loading the package:

library(readr)
library(here)

sci_mo_data <- read_csv(here("data", "processed", "sci_mo_data.csv"))
usethis::use_data(sci_mo_data, overwrite = TRUE)
jrosen48 commented 4 years ago

added the raw data and the code to process it - going to merge this