ColemanResearchGroup / MHQ2

Code for deriving phenotypes from the second Mental Health Questionnaire in UK Biobank
MIT License
1 stars 1 forks source link

Test Data #12

Closed JoniColeman closed 5 months ago

JoniColeman commented 1 year ago

Need to create test data that follows the raw data described in the MHQ2_Raw_Data document. Data should be messy.

mja commented 1 year ago

Should we assume the data has been processed through ukb_conv r and is a .tsv and .r file vs ukb_conv csv, etc?

I know the .tsv file output by ukb_conv r uses integer coding, and the accompanying .r script reads it in and applies factor labels, but I can't remember if the .csv option outputs columns as integers or strings.

mja commented 1 year ago

If the Data field properties and the various "Values for encoding…" tables were available with the new fields, they could be used as the basis for programmatically creating test data.