choishingwan / PRSice

A software package for calculating, applying, evaluating and plotting the results of polygenic risk scores
http://prsice.info
GNU General Public License v3.0
187 stars 90 forks source link

[BUG] PRSice would calculate allele frequencies even if --maf is not there #284

Closed ptn24 closed 2 years ago

ptn24 commented 2 years ago

Describe the bug It seems like PRSice would calculate allele frequencies even if --maf is not there. Is that intended? If it is not necessary, then is there a way to turn it off?

Error Log

PRSice 2.3.5 (2021-09-20) 
https://github.com/choishingwan/PRSice
(C) 2016-2020 Shing Wan (Sam) Choi and Paul F. O'Reilly
GNU General Public License v3
If you use PRSice in any published work, please cite:
Choi SW, O'Reilly PF.
PRSice-2: Polygenic Risk Score Software for Biobank-Scale Data.
GigaScience 8, no. 7 (July 1, 2019)
2021-12-01 08:17:57
./PRSice_linux \
    --a1 A1 \
    --a2 A2 \
    --allow-inter  \
    --bar-levels 0.001,0.05,0.1,0.2,0.3,0.4,0.5,1 \
    --base ... \
    --binary-target T \
    --bp BP \
    --chr CHR \
    --clump-kb 250kb \
    --clump-p 1.000000 \
    --clump-r2 0.100000 \
    --extract ... \
    --interval 5e-05 \
    --keep ... \
    --lower 5e-08 \
    --num-auto 22 \
    --out PRSice \
    --pheno ... \
    --pheno-col trait_prsice \
    --pvalue P \
    --seed 1822096333 \
    --snp SNP \
    --stat BETA \
    --target-list target-list,ukb22828_c21_b0_v3.sample \
    --thread 16 \
    --type bgen \
    --upper 0.5

Initializing Genotype info from file: target-list (bgen) 
With external fam file: ukb22828_c21_b0_v3.sample 

Start processing 
... 
================================================== 

Only one column detected, will assume only SNP ID is 
provided 

Base file: 
... 
GZ file detected. Header of file is: 
CHR     BP      A1      A2      SNP     P       OR 

16543335 variant(s) observed in base file, with: 
7463751 variant(s) excluded based on user input 
1267696 ambiguous variant(s) excluded 
7811888 total variant(s) included from base file 

Loading Genotype info from target 
================================================== 

487409 people (222969 male(s), 264266 female(s)) observed 
41291 founder(s) included 

1152901 variant(s) not found in previous data 
108257 variant(s) included 

Calculate MAF and perform filtering on target SNPs 
================================================== 

108257 variant(s) included
...

To Reproduce If possible, provide a minimum working example for me to reproduce the problem. This can usually speed up the debugging process.

Additional context Add any other context about the problem here.

stale[bot] commented 2 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.