genetics-of-dna-methylation-consortium / godmc_phase2

This repository contains the code to run the analysis pipeline for phase 2 of goDMC released June 2024.
GNU General Public License v3.0
2 stars 0 forks source link

[Bug]: 03g issue with cg07959070 #13

Open drsora opened 1 month ago

drsora commented 1 month ago

Contact Details

sonja.rajic@tuni.fi

Scripts

03g-perform_positive_control.sh

What happened?

03g doesn't run fully, output is below I can't upload 03g because some files are missing, I uploaded a-d

How can the bug be reproduced?

No response

R version

4.4.0 (April, 2024)

Python version

None

Relevant log output

make manhattan and qq plots
Reading in /data/sonja/godmc/results/03/positive_control_untransformed_cg07959070.PHENO1.glm.linear.gz GWAS results

Expecting a large meQTL near 22:50053871
Lowest p-value within 1e+05 base pairs: 0.00127855
WARNING!
There doesn't appear to be a QTL for this positive control
Please upload this section and contact GoDMC analysts before continuing.

Generating QQ-plot without cis chromosome for/data/sonja/godmc/results/03/positive_control_untransformed_cg07959070 with lambda 0.943745114347533
Generating manhantten plot without cis chromosome /data/sonja/godmc/results/03/positive_control_untransformed_cg07959070
Error in manhattan(man_data, bp = names(man_data)[pos_column], chr = names(man_data)[chr_column],  : 
  #CHROM column should be numeric. Do you have 'X', 'Y', 'MT', etc? If so change to numbers and try again.
Calls: main -> manhattan
Execution halted
drsora commented 1 month ago

I ran this in my other cohort and get a different error message: make manhattan and qq plots Reading in /data/sonja/godmc/luric/results/03/positive_control_untransformed_cg07959070.PHENO1.glm.linear.gz GWAS results Error in main() : Wrong column specified for p-values Execution halted

SiyiSEA commented 1 month ago

Hi @drsora ,

I’m just back from holiday, so apologies for the slight delay.

Regarding the first error, it seems that the lowest p-value(0.00127855) within the control window is greater than the positive_control_threshold (0.001). The positive_control_threshold can be set in the parameter file depend on the sample size of your cohort. In addition, the Chromosome column may contain characters other than numbers from 1 to 23.

The second error suggests that the P-value is outside the expected range of (0,1).

Would you mind checking the columns of the /data/sonja/godmc/luric/results/03/positive_control_untransformed_cg07959070.PHENO1.glm.linear.gz file from your two corhorts?

Is column 12 the P-value column? If the P-value is out of the range (0, 1), this could be causing the error. Is column 1 the Chromosome column, and does it only contain values from 1-23? Is column 2 the Position column? Is column 3 the SNP column?

Thanks