This tool is really nice. However, I have difficulty to follow the scripts to run it.
First, my phenotype file is like below, from running ukbconv.
Can you please provide a script that could convert such a data into the format as below?
ID R198 R104 M512 L720 M8414 L031 K802 S662 K267
1 1 1 1 0 0 0 0 0 0
2 0 0 0 1 0 0 0 0 0_
Since there are ~20,000 ICD codes and 500,000 individuals, it might be too big to put all the codes in a single file. So, it would be necessary to have an option to specify a "phenotype inclusion file". For example, I might want to analyze the sub-tree for circulatory diseases, i.e., all ICD-codes starting with letter "I".
Hi, there:
This tool is really nice. However, I have difficulty to follow the scripts to run it.
First, my phenotype file is like below, from running ukbconv.![ICD](https://user-images.githubusercontent.com/26947455/119733637-65698a80-beac-11eb-9c9e-351c9611a568.PNG)
Can you please provide a script that could convert such a data into the format as below? ID R198 R104 M512 L720 M8414 L031 K802 S662 K267 1 1 1 1 0 0 0 0 0 0 2 0 0 0 1 0 0 0 0 0_
Since there are ~20,000 ICD codes and 500,000 individuals, it might be too big to put all the codes in a single file. So, it would be necessary to have an option to specify a "phenotype inclusion file". For example, I might want to analyze the sub-tree for circulatory diseases, i.e., all ICD-codes starting with letter "I".
Please kindly let me know if that is doable.
Best regards, Jie