Closed JoniColeman closed 5 months ago
Should we assume the data has been processed through ukb_conv r
and is a .tsv
and .r
file vs ukb_conv csv
, etc?
I know the .tsv
file output by ukb_conv r
uses integer coding, and the accompanying .r
script reads it in and applies factor labels, but I can't remember if the .csv
option outputs columns as integers or strings.
If the Data field properties and the various "Values for encoding…" tables were available with the new fields, they could be used as the basis for programmatically creating test data.
Need to create test data that follows the raw data described in the MHQ2_Raw_Data document. Data should be messy.