Open bkatiemills opened 9 years ago
@Blahah ha, it could fill one - but it would be nice to wrap up this course with something that sets people on a course of action with the new things they learned. Do you think there's a useful way to go about this in only one session?
For the section on 3. Open Data II: Clean Data
I recommend some 'convention over configuration' advice, and links to evidence based recommended filing systems. My faves are:
\ProjectAcronym
\- History starting YYYY-MM-DD
\- Hold then delete
\Admin
\Documentation
\Posted
\Paper 1
\Correspondence
\Text
\Analysis
\PrePosted
\Resources
\Write
\Work
This concept originally introduced by Josh Reich as the LCFD framework, on the stack overflow website here http://stackoverflow.com/a/1434424, and encoded into the makeProject R package http://cran.r-project.org/web/packages/makeProject/makeProject.pdf.
# choose your project dir
setwd("~/projects")
library(makeProject)
makeProject("makeProjectDemo")
# gives
/makeProjectDemo/
/code/*.R
/data/
/DESCRIPTION
/main.R
# in main.R you put
source("code/load.R")
source("code/clean.R")
source("code/func.R")
source("code/do.R")
/project/
/cache/
/config/
/data/
/diagnostics/
/doc/
/graphs/
/lib/
/helpers.R
/logs/
/munge/
/profiling/
/01_profile.R
/reports/
/src/
/01_EDA.R
/02_clean.R
/03_do.R
/tests/
/01_tests.R
/README
/TODO
Thanks, @ivanhanigan, this is great stuff! We talked about similar things at Study Group Journal Club at UBC the other week - we read this paper and this other paper which touch on related topics - definitely all things to include. Thanks again for the notes!
Open Science 101
This is a session series introducing practical skills needed to get started in open science. A Mozilla Science Study Group can use this series to introduce open science over an academic semester.
Help Us Develop This Curriculum
We're trying to answer these questions:
Let us know your thoughts in the comments!
Sessions