RMHogervorst / summarize_dat

Provide comprehensive documentation for your dataset in a simple way
0 stars 1 forks source link

find functions in codebase that create new variables. #11

Open RMHogervorst opened 8 years ago

RMHogervorst commented 8 years ago

recognize df$name <- in names(df) of ultimate dataset recognize variants such as df[,name] <- or =

make list of these variables.

find first creation of data frame. import statements from foreign readr, read.csv etc repeat and find names?

Other options are the R data provenance software Rdatatracker that unfortunately requires java

RMHogervorst commented 8 years ago

or a reccurent thing per r script first df you created find name with assignment operator f.i. data <- data2 then search for data2 <- data2 <- data1 then search for data1 <-

etc

document per script where things are created. added etc.

until top of document.