DataBiosphere / analysis_pipeline_WDL

Collection of WDL workflows based off the University of Washington TOPMed DCC Best Practices for GWAS. The WDL structure was based upon CWLs written by the Seven Bridges development team.
6 stars 3 forks source link

Mismatch in some, not all, assoc-agg outputs #59

Closed aofarrel closed 2 years ago

aofarrel commented 2 years ago

weights

position:

allele:

aofarrel commented 2 years ago

Here's allele's chr1's position files in Rstudio, Terra vs SB. Something fishy is going on. edit jan 2021: I think "allele's chr1's position files" is supposed to be "chr1's position files" but I don't know if it meant assoc-agg the task or assoc-agg the workflow

Screen Shot 2021-10-26 at 3 11 39 PM Screen Shot 2021-10-26 at 3 11 46 PM

But their graphs are essentially equivalent across SB and Terra. So even though there is an extra column in one of these, it seems that isn't affecting the plotting step.

Terra QQ

terra_allele_qq

SB QQ

allele_qq

Terra manh

terra_allele_manh

SB manh

allele_manh

aofarrel commented 2 years ago

If we look at weights instead...

assoccwl: weights_rerun_sbg_chr1_seg1.RData as a group_segments task output assocwdl: 1KG_phase3_subset_annot.RData as an assoc_aggregate task input

all.equal(assoccwl, assocwdl, tolerance=1e-100) [1] "Component “results”: Component “Score”: Mean relative difference: 1.48677959e-16"
[2] "Component “results”: Component “Score.Stat”: Mean relative difference: 1.32878924e-16" [3] "Component “results”: Component “Score.pval”: Mean relative difference: 6.739930668e-15" [4] "Component “results”: Component “Est”: Mean relative difference: 2.375174983e-16"
[5] "Component “results”: Component “PVE”: Mean relative difference: 3.367141493e-16"
options(digits=22) print(assocwdl[["results"]]["Score"]) Score 375790 11.94768109113651455289 print(assoccwl[["results"]]["Score"]) Score 375790 11.94768109113651632924

aofarrel commented 2 years ago

This is still an issue but given the big changes that happened upstream, it is worth making a new ticket for it.