Friday July 8th, 9:30am

Meeting Agenda

I will update my MAP files before the meeting.

Go through the files 0-7 : address: any changes or errors, clean ups
Some help with files 5-7
Discuss plans on how to merge the new variables into MAP curator github
Wait for new data pull, then work on bringing in the new data into MAP curator

andkov commented 8 years ago

NOTE:

you must filter out the observation form different studies (non MAP) before you do the merger between the wide and the long file. Otherwise you will produce duplicated rows that would not to be removed later

andkov commented 8 years ago

Amelia - a package to visualize missing patterns http://www.inside-r.org/packages/cran/Amelia/docs/missmap

#check missing data
#this can be intensive, save everything in case computer crashes 
Amelia::missmap(dsw, main = "Missing values vs observed")

andkov commented 8 years ago

tighten up the subsetting and creation of new variables:

# ---- tweak-data ---------------------------------------
d <- ds %>% 
  dplyr::mutate(
    vital_status    =  ifelse(is.na(age_death) , 0 , 1), # jamie, this is it.
    dementia_status =  ifelse(is.na(age_death) , 0 , 1),
    stroke_status   =  stroke_cum,
    path_status     =  ad_reagan,
    apoe_genotype   =  ifelse(apoe_genotype %in% c(44,34,24), 1, 0),
    group_smell     =  ordered(cut(total_smell_test, c(0,5,10,12), 
                                   labels=c("anosmic", "hyposmic", "normosmic")))
)

andkov commented 8 years ago

x: age y: mmse facets: smell grouping and apoe status color : cumulative lifetime stroke status

andkov commented 8 years ago

x: age y: mmse facets: smell group and pathology status (1 = most pathologies, 3 = least ) color: apoe status

knighttime commented 8 years ago

These graphs are super! is there any way to get rid of the NA column? - will pick this up next friday

andkov commented 8 years ago

Yes, just modify the dplyr::filter() call

ampiccinin commented 8 years ago

re: filtering non-MAP studies - could this also be resolved by retaining all data from all studies (which someone might like to do)?

knighttime commented 8 years ago

@andkov - got it!!! Thanks + I bought that graphing book 👍

knighttime commented 8 years ago

@ampiccinin We discovered that the duplicates were mostly in MARS. So subsetting down to just MAP should fix the duplicates - though a bit more experimentation is needed to figure out why both Cassandra and Rebecca found duplicates in theirs - I thought that their data sets were already subsetted down to just MAP - so we need to check that.

IALSA / longitudinal-response-pattern

2016-07-08 #5

Friday July 8th, 9:30am