This repository is no longer being updated as many of the functions will be moved to https://github.com/LivingNorway/LivingNorwayR
Repro containing suggestive archive structure and procedures for data storage
The data package should work both for legacy data and contemporary data. With "legacy data" we understand data that not longer are actively managed. While the principles for data management and documentation is the same, the workflow may be slightly different. Due to the sheer volume of legacy data floating around in different institutions, there is a need for low-threshold archiving with the purpose of preserving information at a level where it may be retrievable, but not investing excessively preparation for use. Hence, priority here should be given to archiving raw data (either digitized or scanned analog material) with a minimum set of metadata to facilitate discovery and re-use.
Data-repository structure is simple and contains dedicated folders for documentation, data, source code along with metadata:
├── minimum_metadata.txt (or .md)
├── docs/ -dokumentation (e.g. procedual reports, laboratory protocols)
├── data/
| ├── scan_data/ - analog data in digital format
| ├── raw_data/ - raw data - born digital or punched from paper forms
| ├── mapped_data/ - data mapped from raw data
├── src/ - scripts et al. used for mapping data
├── meta.xml - metadata in EML
├── dmp.xxx - data managment plan
A typical project, either involving data rescue of legacy-data or contemporary data would involve several logical steps. Main parts outlined below:
Also, the general guidlines from BES could be useful.
TheDataPackage is still in development but you can download the latest version using the following code:
devtools::install_github("LivingNorway/TheDataPackage")
There is a short (very short at the moment) example workflow that will lead you through some of the steps in developing a data package. We recommend opening a new Project in RStudio and then running the following code in it. Follow the instructions shown in your console:
demo("workflow_example",package="TheDataPackage")