IALSA / ialsa-2016-groningen

Maelstrom Harmonization Workshop. Assessing the impact of different harmonization procedures on the analysis results from several real datasets.
GNU General Public License v2.0
1 stars 0 forks source link

pilot (A) for analytic workflow #5

Open andkov opened 8 years ago

andkov commented 8 years ago

@smhofer prosed the following plan for the reproducible report(s):

andkov commented 8 years ago

@smhofer , here is my commentary on your five sections. I need to introduce a slight modification to account for the way the scripts actually deal with the data. Specifically, I suggest implementing the processes in Section 2 and 3 for each set of harmonized variables separately. It's more practical to organize it this way and it will not change the end result of Section 3 : creation of a combined data set.

The script ./manipulation/0-ellis-island.r produces a working report ./manipulation/stitched-output/0-ellis-island.md. This report accomplishes accomplished Section (1), (2b), (3a). I've copiously annotated it and it's meant to be a part of the live documentation. This is where one will go to find out how specifically the processes in section (1), (2a), and (3a) have been implemented.

Note that Section (2a) is accomplished outside of R by editing the file ./data/shared/meta-data-map.csv. I don't think it's a good idea for projects like these to conduct renaming by hand in script. This is my biggest lesson learned from Portland, so I'd like to gently insist on this.

I'm moving on to developing the scripts to implement Section (3c) for smoking.