@jaroberti Thought we could try a more functional naming scheme for our scripts. Let me know what you think.
W1...R : script that wrangles data
M1...R : script that executes model based on output from "W#...R" files
E1....R: script for doing graphical or exploratory review of data from W1.R and M1.R
no_prefix...R : old script that does too many things, should divide into more task oriented scripts into the scripts2018 folder
S1...R : script that does too much stuff, should be "atomized" into a file that does one tasks e.g. data wrangling/processing, data modeling, or data exploration
NOTE: numbers imply sequence files should be run in within a category
e.g.
W1.R %>% W2.R %>% M1.R %>% E1.R
W1.R might calculate a rank and that's it
W2.R is a second wrangling file that might add additional team statistics, but it should be run AFTER W1.R
M1.R sources/imports data output from W1.R, runs a random forest where the model = win_loss ~ elo_rank
E1.R might be a file that runs ggplot on data from W1.R AND visualizes the random forest results from M1.R
@jaroberti Thought we could try a more functional naming scheme for our scripts. Let me know what you think.
W1...R
: script that wrangles dataM1...R
: script that executes model based on output from "W#...R" filesE1....R
: script for doing graphical or exploratory review of data from W1.R and M1.Rno_prefix...R
: old script that does too many things, should divide into more task oriented scripts into thescripts2018
folderS1...R
: script that does too much stuff, should be "atomized" into a file that does one tasks e.g. data wrangling/processing, data modeling, or data explorationNOTE: numbers imply sequence files should be run in within a category
e.g.
W1.R
%>%W2.R
%>%M1.R
%>%E1.R
win_loss ~ elo_rank
ggplot
on data from W1.R AND visualizes the random forest results from M1.R