zhengxwen / HIBAG

R package – HLA Genotype Imputation with Attribute Bagging (development version only)
https://hibag.s3.amazonaws.com/index.html
29 stars 7 forks source link

More than 50%of SNPs are missing #22

Open seagullOnABrick opened 1 year ago

seagullOnABrick commented 1 year ago

Hi, I am currently working on the HLA imputation of a Norwegian cohort, using SNPs. Every time I launch the HIBAG R script, whether I use the pre-fit models or build and predict in parallel, I always get the warning “More than 50% of SNPs are missing!”. I have checked the input PLINK files I use, and they contain most of SNPs IDs (252/275) coming from the HapMap_CEU_Geno$snp.id list. Could you tell me what can trigger this warning to appear in the R code you wrote, so I can correct my use of your software? Thank you.

JingjingBai2021 commented 1 year ago

I guess it depends on your genotype platform. If you use immuno-chip then the model should be derived from immuno-chip, otherwise you would have a low mapping rate.

zhengxwen commented 1 year ago

Please use platform-specific models