getian107 / PRScsx

Cross-population polygenic prediction
MIT License
65 stars 20 forks source link

A follow-up tutorial on calculating polygenic score #19

Closed fys666 closed 2 years ago

fys666 commented 2 years ago

Dear Tian,

First,Thank you for your valuable tool for PRS analysis. It really helps me a lot.

I'm new to the genetics. May I ask the key follow-up parts, such as how the results built using your example files can be combined with plink? That is to say, how to use the results of PRScsx and plink to calculate the polygenic score?

Could you give a more detailed tutorial? I looked at plink's tutorial and found that they provide three extra files in addition to the results given by PRScsx.

Binary file containing sample identifier (ID) and genotype (.bed); Contains relevant individual information files (.fam); Genetic marker files (*.bim).

For example: plink \ --bfile EUR.QC \ --score result_EAS_pst_eff_a1_b0.5_phi1e-02_chr22.txt 2 4 6 \ --out EUR

They provide "--bfile". These are the three documents I mentioned earlier. Could you please provide them as well?

Sorry for my poor English.

Bests,

fys

getian107 commented 2 years ago

Hi- The three files (.bed, .bim and .fam) correspond to your target dataset (i.e., the dataset in which you would like to evaluate the performance of the polygenic score). Briefly PRS-CS(x) provides adjusted SNP weights using GWAS summary statistics. You can then take those weights to your target dataset to calculate individual-level polygenic scores. Let me know if you need more information. And if you are new to genetics and PRS analysis, this review might be helpful: https://www.nature.com/articles/s41596-020-0353-1

fys666 commented 2 years ago

Thanks for your quick reply and the review you recommend. Have a good day!