vanderbilt-data-science / ancient-artifacts

Dynamic image analysis to identify ancient artifacts in soil samples. We will work on microdebitage (the debris of ancient stone knapping first) and later expand to other materials (e.g., mortars).
MIT License
6 stars 2 forks source link

Generate template code to load data using vroom #8

Closed csbell-vu closed 3 years ago

csbell-vu commented 3 years ago

Although we have reasonable-sized data currently committed to the repo for analysis, we expect that very soon, we will have data that has hundreds of thousands to millions of rows. For this reason, we'll be exploring different methods of loading data and generating template code for one of these methods.

Issue #7 details the structure for the loaded data. Another approach for reading the data can be through using the R package vroom. Vroom allows the usage of multiple threads and lazy loading to accelerate data reading and writing. Documentation and example code can be found here. Note the requirements of the data structure format created in Issue #7, and vroom contains several arguments which can facilitate this desired structure.

The work should occur in a notebook prefixed with the number 12, e.g., 12-vroom-load.Rmd or similar.