merlebehr / epiTree

0 stars 1 forks source link

example file #1

Open yupinglin0301 opened 1 year ago

yupinglin0301 commented 1 year ago

hi, do you have the example file for phenotype and mapping file? Thanks

merlebehr commented 1 year ago

Hi Linyu,

the phenotype file is a whitespace delimited txt file with two columns, where the first column corresponds to a subject ID and the second column to the respective binary phenotype of that subject, e.g., of the form

IND1 0 IND2 1 IND3 1 …

the mapping file is a csv file with two columns, where the first column corresponds to the subject IDs from the phenotype file and the second column corresponds to respective subject ID in the genotype file (bim/fam/bam format), e.g., of the form

IND1, GENO-IND1 IND2, GENO-IND2 IND3, GENO-IND3 …

If the genotype file uses the same subject ID’s as the phenotype file, then this file would be of the form

IND1, IND1 IND2, IND2 IND3, IND3 …

Unfortunately, we cannot share the original UKBB individual level data files with you, but you would need to apply for your own UKBB access in order to receive those files.

I hope this helps.

Best, Merle