zhengxwen / HIBAG

R package – HLA Genotype Imputation with Attribute Bagging (development version only)
https://hibag.s3.amazonaws.com/index.html
29 stars 7 forks source link

The numbers of SNPs are not consistent #26

Open JingjingBai2021 opened 1 year ago

JingjingBai2021 commented 1 year ago

Hi , I am using Hibag for HLA imputation. My dataset is GSA genotyped data.

I found when I use the function "hlaBED2Geno" to import the genotype data, there are around 8000 SNPS in the MHC region. But when I use plink to extract MHC region from my genotype data, there are around 40,000 SNPs.

I am wondering where the discrepancy comes from. And for the HLA Manhattan plot, I am trying to merge the SNPs in the MHC region with imputed HLA alleles. Is there any function that I can use in HIBAG to accomplish that?

Thank you in advance!

zhengxwen commented 1 year ago
     hlaBED2Geno(bed.fn, fam.fn, bim.fn, rm.invalid.allele=FALSE,
         import.chr="xMHC", assembly="auto", verbose=TRUE)

Set import.chr="6", try it again.