countering-bean-counting / bonnyci_ci-plunder

CI usage data plundering
2 stars 0 forks source link

Serialize github data (rda/rds) #46

Closed missaugustina closed 7 years ago

missaugustina commented 7 years ago

See if there's an advantage to doing this versus just loading the CSV files. These are some big files so whatever gives us the best optimization.

missaugustina commented 7 years ago

I decided to keep larger data sets in Google Bigquery and to serialize summary tables and samples only.