ToolsVanBox / PTATO

PTa Analysis TOolkit
MIT License
9 stars 3 forks source link

SMuRF Performance issue #11

Closed um-pdavila closed 1 year ago

um-pdavila commented 1 year ago

Good afternoon,

The SMuRF section of our PTATO job has been running since 2023-04-26. We want to confirm this is not because of a bug. That is, that SMuRF's performance is the result of our dataset and parallelization limitations of SMuRF, as used in PTATO. We have spoken with several collaborators who are experiencing similar performance issues using PTATO (because of SMuRF).

Our dataset has 170 single cell, with coverage of 13x. This is much larger than the demo dataset, which does complete.

The SLURM job is running on 1 node, for two weeks, using 23 cores (64 allocated).

[fxd317@n533 ~]$ uptime load average: 22.35, 22.21, 22.19

[root@n521 ~]# du -hs $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp 59G /sccc/scratch/fxd317/DATA/bioskryb_3-6-2023/PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp

[root@n521 ~]# ls -lhtc $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/* | wc -l 3366

$ ls -lhtc /sccc/scratch/fxd317/DATA/bioskryb_3-6-2023/PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/* | head -n 25 -rw-r--r-- 1 fxd317 ccsuser 2.3G May 8 13:53 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/4.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.5G May 8 13:51 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/6.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 3.3G May 8 13:49 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/16.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.2G May 8 13:49 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/X.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 3.4G May 8 13:49 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/19.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.5G May 8 13:48 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/18.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.9G May 8 13:45 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/13.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.4G May 8 13:44 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/9.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.3G May 8 13:43 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/11.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 3.2G May 8 13:42 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/20.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.2G May 8 13:40 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/3.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.6G May 8 13:39 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/12.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.5G May 8 13:36 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/8.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 3.2G May 8 13:33 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/17.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.8G May 8 13:32 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/1.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.2G May 8 13:27 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/14.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.5G May 8 13:26 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/15.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.3G May 8 13:26 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/5.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.2G May 8 13:24 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/2.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 3.3G May 8 13:22 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/10.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.2G May 8 13:09 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/7.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 2.6G May 7 07:41 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/21.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 1.9G Apr 30 22:38 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/22.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 210K Apr 26 00:05 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/Un_KI270394v1.SMuRF.vcf -rw-r--r-- 1 fxd317 ccsuser 210K Apr 26 00:05 $PTATO/work/d6/3892059004580b0cfbf223d5421269/SMuRF_tmp/Un_KI270392v1.SMuRF.vcf

... 3340 additional files created and last accessed on “Apr 26 00:05”.

Best, Pedro

mroosmalen commented 1 year ago

I know SMuRF is not optimized for a lot of samples. We have tested it only on 5-10 samples , and it works the fastest if you give it also a control. What SMuRF does is it checks every mutation for each sample the raw bam file, to check if its finds any evidence for this mutation in the specified sample. To make it faster it will make some assumes, it will skip for example every mutation with a FILTER tag in the vcf file (keeps only the PASS variants) but also mutations that are found in the control (e.g. germline mutations) will be skipped.

The To give you some examples from our side:

17 hours for 8 cord blood samples and 1 control (15x) with 8 threads 8 hours for a AML patient with 4 samples and 1 control (15x) with 8 threads

I never tested it, but you could give PTATO also your own somatic vcf files per sample and skip SMuRF. The you should mimic this kind of folder:

/path/to/somatic_vcfs/DONOR_ID
       ./sample1.vcf.gz
       ./sample2.vcf.gz
       ....

and then point to this folder in your run config file

 short_variants {
      somatic_vcfs_dir = '/path/to/somatic_vcfs' # without the DONOR_ID
}

I hope this give will help you

um-pdavila commented 1 year ago

Thank you so much Dr. Roosmalen for the explanation and code. Following your suggestion we generated the somatic vcf files by running Smurf independently. Now we would like to run PTATO by providing these somatic vcf files in order to skip the SMURF steps. Please, could you provide us with detailed instructions to do that (including the setting in the config file and the input/output directory structure)?

We tried the following:

    optional {
      //germline_vcfs_dir = '/path_to/ptato/1.2.0/output/intermediate/germline'
      germline_vcfs_dir = ''

       short_variants {
         somatic_vcfs_dir = '/path_to/ptato/1.2.0/lab/jobs/donor1/input/somatic_vcfs'
         walker_vcfs_dir = ''
         //phased_vcfs_dir = '/path_to/output/intermediate/short_variants/shapeit'
         phased_vcfs_dir = ''
         ab_tables_dir = ''
         context_beds_dir = ''
         features_beds_dir = ''
       }
  
mroosmalen commented 1 year ago

This looks good to me. This doesn't work? Did you get an error, if so can you show me the error and the content of the somatic_vcfs folder?

um-pdavila commented 1 year ago

The job "completes" without error, but does not run all the nextflow scheduled process. All these processes complete with the PTATO demo dataset with the snvs and indels run options set as true. We have QC, svs, and cnvs currently set to false in the run.config.

Do we need to provide the a joint vcf file, which includes all samples, or multiple vcf files (one per sample)?

Logs

executor >  slurm (48)
[-        ] process > get_gzipped_vcfs:bgzip                                                  -
[ac/f0aa3c] process > get_gzipped_vcfs:tabix (tabix joint_output-hc.g_noChr_PASS)             [100%] 1 of 1 ✔
[-        ] process > get_indexed_bams:Index                                                  -
[dd/c65dea] process > get_indexed_bams:GetSampleName (GATK GetSampleName Oh-12453-335)        [100%] 172 of 172, cached: 172 ✔
[57/4563ba] process > get_germline_vcfs:SnpSift (SnpSift Oh-12453)                            [100%] 1 of 1 ✔
[40/29953a] process > get_germline_vcfs:bgzip (bgzip joint_output-hc.g_noChr_PASS)            [100%] 1 of 1 ✔
[79/7b5c45] process > get_germline_vcfs:tabix (tabix joint_output-hc.g_noChr_PASS)            [100%] 1 of 1 ✔
[-        ] process > short_variants:get_gzipped_vcfs:bgzip                                   -
[-        ] process > short_variants:get_gzipped_vcfs:tabix                                   -
[42/f651f0] process > short_variants:get_ab_tables:shapeit (shapeit Oh-12453 1)               [100%] 22 of 22 ✔
[5d/beadb6] process > short_variants:get_ab_tables:tabix (tabix joint_output-hc.g_noChr_PASS) [100%] 22 of 22 ✔
[-        ] process > short_variants:get_ab_tables:createABtable                              -
[-        ] process > short_variants:get_ab_tables:mergeABtable                               -
[-        ] process > short_variants:get_walker_vcfs:walker                                   -
[-        ] process > short_variants:get_walker_vcfs:bgzip                                    -
[-        ] process > short_variants:get_walker_vcfs:tabix                                    -
[-        ] process > short_variants:get_context_beds:getContext                              -
[-        ] process > short_variants:get_context_beds:sort                                    -
[-        ] process > short_variants:closest_feature:closest                                  -
[-        ] process > short_variants:closest_feature:groupby                                  -
[-        ] process > short_variants:intersect_feature:intersect                              -
[-        ] process > short_variants:intersect_feature:groupby                                -
[-        ] process > short_variants:groupby_features:intersectAll                            -
[-        ] process > short_variants:groupby_features:groupbyAll                              -
[-        ] process > short_variants:SplitVcfs                                                -
[-        ] process > short_variants:snvs:get_snvs_rf_tables:createSnvRfTable                 -
[-        ] process > short_variants:snvs:get_ptato_vcfs:test_snv_rf                          -
[-        ] process > short_variants:snvs:get_ptato_vcfs:bgzip                                -
[-        ] process > short_variants:snvs:get_ptato_vcfs:tabix                                -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoCutoff                       -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoFilter                       -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:bgzip                             -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:tabix                             -
[-        ] process > short_variants:indels:get_ptato_vcfs:excludeIndels                      -
[-        ] process > short_variants:indels:get_ptato_vcfs:bgzip                              -
[-        ] process > short_variants:indels:get_ptato_vcfs:tabix                              -
[-        ] process > short_variants:indels:filter_ptato_vcfs:ptatoIndelFilter                -
[-        ] process > short_variants:indels:filter_ptato_vcfs:bgzip                           -
[-        ] process > short_variants:indels:filter_ptato_vcfs:tabix                           -
[-        ] process > short_variants:intersect_ptato_vcfs:intersectPTATO                      -
[-        ] process > short_variants:intersect_ptato_vcfs:bgzip                               -
[-        ] process > short_variants:intersect_ptato_vcfs:tabix                               -
[-        ] process > short_variants:merge_ptato_vcfs:mergePtatoVcfs                          -
[-        ] process > short_variants:merge_ptato_vcfs:bgzip                                   -
[-        ] process > short_variants:merge_ptato_vcfs:tabix                                   -
Completed at: 13-May-2023 13:09:23
Duration    : 2h 37m 15s
CPU hours   : 156.9 (0.2% cached)
Succeeded   : 48
Cached      : 172

Somatic VCF and other inputs

[pdavila@n521 Oh-12453-si]$ tree ../Oh-12453/input/
../Oh-12453/input/
├── bam
│   └── Oh-12453
│       ├── 10-418175-23.bam
│       ├── 10-418175-23.bam.bai
│       ├── Oh-12453-197.bam
│       ├── Oh-12453-197.bam.bai
...
│       ├── Oh-12453-367.bam   (172 total)
│       └── Oh-12453-367.bam.bai
├── somatic_vcfs
│   └── Oh-12453
│       ├── joint_output-hc.g.nochr.Oh-12453-197_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-197_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-198_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-198_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-199_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-199_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-200_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-200_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-201_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-201_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-202_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-202_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-203_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-203_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-204_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-204_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-205_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-205_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-206_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-206_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-207_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-207_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-208_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-208_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-209_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-209_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-210_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-210_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-211_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-211_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-212_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-212_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-213_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-213_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-214_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-214_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-215_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-215_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-216_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-216_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-217_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-217_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-218_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-218_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-219_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-219_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-220_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-220_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-221_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-221_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-222_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-222_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-223_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-223_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-224_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-224_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-225_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-225_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-226_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-226_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-227_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-227_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-228_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-228_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-229_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-229_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-230_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-230_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-231_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-231_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-232_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-232_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-233_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-233_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-234_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-234_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-235_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-235_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-236_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-236_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-237_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-237_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-238_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-238_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-239_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-239_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-240_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-240_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-241_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-241_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-242_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-242_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-243_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-243_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-244_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-244_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-245_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-245_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-246_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-246_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-247_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-247_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-248_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-248_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-249_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-249_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-250_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-250_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-251_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-251_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-252_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-252_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-253_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-253_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-254_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-254_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-255_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-255_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-256_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-256_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-257_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-257_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-258_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-258_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-259_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-259_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-260_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-260_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-261_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-261_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-262_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-262_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-263_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-263_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-264_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-264_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-265_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-265_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-266_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-266_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-267_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-267_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-268_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-268_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-269_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-269_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-270_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-270_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-271_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-271_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-272_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-272_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-273_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-273_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-274_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-274_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-275_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-275_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-276_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-276_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-277_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-277_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-278_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-278_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-279_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-279_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-280_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-280_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-281_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-281_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-282_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-282_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-283_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-283_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-284_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-284_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-285_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-285_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-286_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-286_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-287_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-287_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-288_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-288_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-289_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-289_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-290_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-290_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-291_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-291_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-292_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-292_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-293_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-293_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-294_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-294_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-295_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-295_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-296_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-296_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-297_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-297_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-298_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-298_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-299_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-299_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-300_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-300_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-301_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-301_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-302_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-302_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-303_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-303_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-304_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-304_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-305_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-305_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-306_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-306_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-307_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-307_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-308_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-308_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-309_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-309_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-310_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-310_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-311_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-311_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-312_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-312_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-313_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-313_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-314_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-314_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-315_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-315_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-316_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-316_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-317_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-317_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-318_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-318_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-319_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-319_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-320_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-320_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-321_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-321_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-322_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-322_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-323_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-323_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-324_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-324_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-325_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-325_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-326_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-326_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-327_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-327_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-328_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-328_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-329_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-329_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-330_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-330_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-331_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-331_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-332_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-332_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-333_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-333_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-334_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-334_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-335_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-335_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-336_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-336_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-337_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-337_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-338_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-338_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-339_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-339_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-340_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-340_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-341_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-341_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-342_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-342_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-343_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-343_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-344_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-344_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-345_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-345_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-346_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-346_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-347_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-347_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-348_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-348_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-349_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-349_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-350_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-350_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-351_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-351_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-352_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-352_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-353_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-353_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-354_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-354_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-355_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-355_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-356_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-356_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-357_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-357_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-358_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-358_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-359_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-359_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-360_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-360_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-361_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-361_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-362_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-362_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-363_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-363_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-364_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-364_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-365_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-365_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-366_n.SMuRF.filtered.vcf.gz
│       ├── joint_output-hc.g.nochr.Oh-12453-366_n.SMuRF.filtered.vcf.gz.tbi
│       ├── joint_output-hc.g.nochr.Oh-12453-367_n.SMuRF.filtered.vcf.gz
│       └── joint_output-hc.g.nochr.Oh-12453-367_n.SMuRF.filtered.vcf.gz.tbi
└── vcf
    └── Oh-12453
        └── joint_output-hc.g_noChr_PASS.vcf.gz

Output

[pdavila@n521 Oh-12453-si]$ tree output/
output/
├── intermediate
│   ├── germline
│   │   └── Oh-12453
│   │       ├── joint_output-hc.g_noChr_PASS.germline.vcf.gz
│   │       └── joint_output-hc.g_noChr_PASS.germline.vcf.gz.tbi
│   └── short_variants
│       └── shapeit
│           └── Oh-12453
│               ├── joint_output-hc.g_noChr_PASS.10.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.10.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.11.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.11.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.12.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.12.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.13.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.13.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.14.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.14.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.15.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.15.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.16.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.16.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.17.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.17.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.18.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.18.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.19.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.19.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.1.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.1.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.20.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.20.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.21.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.21.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.22.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.22.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.2.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.2.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.3.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.3.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.4.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.4.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.5.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.5.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.6.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.6.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.7.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.7.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.8.phased.vcf.gz
│               ├── joint_output-hc.g_noChr_PASS.8.phased.vcf.gz.tbi
│               ├── joint_output-hc.g_noChr_PASS.9.phased.vcf.gz
│               └── joint_output-hc.g_noChr_PASS.9.phased.vcf.gz.tbi
└── log
    ├── nextflow_report.html
    ├── nextflow_timeline.html
    ├── nextflow_trace.txt
    └── slurm
        ├── 12605.err
        ├── 12605.out
um-pdavila commented 1 year ago

Hi Dr. Roosmalen,

The demo_src your lab provided has been invaluable for us. Thank you all so much for your work, time, and help.

After modifying the somatic vcf file names to match the filenames expected in the PTATO code, we progressed further with PTATO (run 2023/05/16). We got a new error with the 'short_variants:get_walker_vcfs:walker (walker joint_output-hc.g.nochr_Oh-12453-205).

We were able to get pass this error by removing our normal sample from our somatic files (current run 2023/05/17). :)

Details on both these runs below, posting in case it help others. We used the NextFlow resume option for both. PTATO is re-running the short_variants process on the current run, as we expected.

(run 2023/05/16)

executor >  slurm (6329)
[-        ] process > get_gzipped_vcfs:bgzip                                                                                       -
[6e/1f22f4] process > get_gzipped_vcfs:tabix (tabix joint_output-hc.g_noChr_PASS)                                                  [100%] 1 of 1 ✔
[-        ] process > get_indexed_bams:Index                                                                                       -
[60/16ee1a] process > get_indexed_bams:GetSampleName (GATK GetSampleName Oh-12453-335)                                             [100%] 172 of 172 ✔
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:bgzip                                                                     -
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:tabix                                                                     -
[-        ] process > short_variants:get_gzipped_vcfs:bgzip                                                                        -
[-        ] process > short_variants:get_gzipped_vcfs:tabix                                                                        -
[3b/f54472] process > short_variants:get_ab_tables:createABtable (createABtable joint_output-hc.g.nochr_Oh-12453-294 4)            [100%] 3762 of 3762 ✔
[-        ] process > short_variants:get_ab_tables:mergeABtable (mergeABtable joint_output-hc.g.nochr_Oh-12453-268)                -
[89/93ff97] process > short_variants:get_walker_vcfs:walker (walker joint_output-hc.g.nochr_Oh-12453-259)                          [350%] 14 of 4, failed: 14
[-        ] process > short_variants:get_walker_vcfs:bgzip                                                                         -
[-        ] process > short_variants:get_walker_vcfs:tabix                                                                         -
[a6/15b99c] process > short_variants:get_context_beds:getContext (getContext joint_output-hc.g.nochr_Oh-12453-333)                 [100%] 171 of 171 ✔
[e3/d73473] process > short_variants:get_context_beds:sort (bedtoolsSort joint_output-hc.g.nochr_Oh-12453-333)                     [100%] 171 of 171 ✔
[20/756a86] process > short_variants:closest_feature:closest (bedtoolsClosest joint_output-hc.g.nochr_Oh-12453-333 SIMPLEREPEAT)   [100%] 342 of 342 ✔
[36/059aab] process > short_variants:closest_feature:groupby (bedtoolsGroupby joint_output-hc.g.nochr_Oh-12453-333 SIMPLEREPEAT)   [100%] 301 of 301
[7c/97fcdd] process > short_variants:intersect_feature:intersect (bedtoolsIntersect joint_output-hc.g.nochr_Oh-12453-333 REPLISEQ) [100%] 342 of 342 ✔
[97/006eed] process > short_variants:intersect_feature:groupby (bedtoolsGroupby joint_output-hc.g.nochr_Oh-12453-333 REPLISEQ)     [100%] 300 of 300
[-        ] process > short_variants:groupby_features:intersectAll                                                                 -
[-        ] process > short_variants:groupby_features:groupbyAll                                                                   -
[1a/8c3f6b] process > short_variants:SplitVcfs (GATK SplitVcfs joint_output-hc.g.nochr_Oh-12453-329)                               [100%] 171 of 171 ✔
[-        ] process > short_variants:snvs:get_snvs_rf_tables:createSnvRfTable                                                      -
[-        ] process > short_variants:snvs:get_ptato_vcfs:test_snv_rf                                                               -
[-        ] process > short_variants:snvs:get_ptato_vcfs:bgzip                                                                     -
[-        ] process > short_variants:snvs:get_ptato_vcfs:tabix                                                                     -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoCutoff                                                            -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoFilter                                                            -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:bgzip                                                                  -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:tabix                                                                  -
[-        ] process > short_variants:indels:get_ptato_vcfs:excludeIndels (excludeIndels joint_output-hc.g.nochr_Oh-12453-333)      -
[-        ] process > short_variants:indels:get_ptato_vcfs:bgzip                                                                   -
[-        ] process > short_variants:indels:get_ptato_vcfs:tabix                                                                   -
[-        ] process > short_variants:indels:filter_ptato_vcfs:ptatoIndelFilter                                                     -
[-        ] process > short_variants:indels:filter_ptato_vcfs:bgzip                                                                -
[-        ] process > short_variants:indels:filter_ptato_vcfs:tabix                                                                -
[-        ] process > short_variants:intersect_ptato_vcfs:intersectPTATO                                                           -
[-        ] process > short_variants:intersect_ptato_vcfs:bgzip                                                                    -
[-        ] process > short_variants:intersect_ptato_vcfs:tabix                                                                    -
[-        ] process > short_variants:merge_ptato_vcfs:mergePtatoVcfs                                                               -
[-        ] process > short_variants:merge_ptato_vcfs:bgzip                                                                        -
[-        ] process > short_variants:merge_ptato_vcfs:tabix                                                                        -
ERROR ~ Error executing process > 'short_variants:get_walker_vcfs:walker (walker joint_output-hc.g.nochr_Oh-12453-205)'

Caused by:
  Process `short_variants:get_walker_vcfs:walker (walker joint_output-hc.g.nochr_Oh-12453-205)` terminated with an error exit status (140)

The "[350%] 14 of 4, failed: 14" is concerning, not sure why there were 14 of 4 runs yet.

(run 2023/05/17)

...
germline_vcfs_dir = 'path_to/output/intermediate/germline'

short_variants {
 somatic_vcfs_dir = 'path_to/output/intermediate/short_variants/somatic_vcfs'
 walker_vcfs_dir = ''
 phased_vcfs_dir = 'path_to/output/intermediate/short_variants/shapeit'
 ab_tables_dir = ''
 context_beds_dir = ''
 features_beds_dir = ''
}
...
executor >  slurm (4404)
[-        ] process > get_gzipped_vcfs:bgzip                                                                             -
[6e/1f22f4] process > get_gzipped_vcfs:tabix (tabix joint_output-hc.g_noChr_PASS)                                        [100%] 1 of 1, cached: 1 ✔
[-        ] process > get_indexed_bams:Index                                                                             -
[c4/c965c4] process > get_indexed_bams:GetSampleName (GATK GetSampleName Oh-12453-340)                                   [100%] 172 of 172, cached: 172 ✔
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:bgzip                                                           -
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:tabix                                                           -
[-        ] process > short_variants:get_gzipped_vcfs:bgzip                                                              -
[-        ] process > short_variants:get_gzipped_vcfs:tabix                                                              -
[dc/04cf23] process > short_variants:get_ab_tables:createABtable (createABtable joint_output-hc.g.nochr_Oh-12453-360 11) [  2%] 79 of 3762
[-        ] process > short_variants:get_ab_tables:mergeABtable                                                          -
[a7/1c4e3f] process > short_variants:get_walker_vcfs:walker (walker joint_output-hc.g.nochr_Oh-12453-310)                [  0%] 0 of 171
[-        ] process > short_variants:get_walker_vcfs:bgzip                                                               -
[-        ] process > short_variants:get_walker_vcfs:tabix                                                               -
[7c/be1547] process > short_variants:get_context_beds:getContext (getContext joint_output-hc.g.nochr_Oh-12453-254)       [ 75%] 129 of 171
[46/936555] process > short_variants:get_context_beds:sort (bedtoolsSort joint_output-hc.g.nochr_Oh-12453-254)           [  0%] 0 of 129
[-        ] process > short_variants:closest_feature:closest                                                             -
[-        ] process > short_variants:closest_feature:groupby                                                             -
[-        ] process > short_variants:intersect_feature:intersect                                                         -
[-        ] process > short_variants:intersect_feature:groupby                                                           -
[-        ] process > short_variants:groupby_features:intersectAll                                                       -
[-        ] process > short_variants:groupby_features:groupbyAll                                                         -
[82/6183c9] process > short_variants:SplitVcfs (GATK SplitVcfs joint_output-hc.g.nochr_Oh-12453-322)                     [ 77%] 133 of 171
[-        ] process > short_variants:snvs:get_snvs_rf_tables:createSnvRfTable                                            -
[-        ] process > short_variants:snvs:get_ptato_vcfs:test_snv_rf                                                     -
[-        ] process > short_variants:snvs:get_ptato_vcfs:bgzip                                                           -
[-        ] process > short_variants:snvs:get_ptato_vcfs:tabix                                                           -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoCutoff                                                  -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoFilter                                                  -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:bgzip                                                        -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:tabix                                                        -
[-        ] process > short_variants:indels:get_ptato_vcfs:excludeIndels                                                 -
[-        ] process > short_variants:indels:get_ptato_vcfs:bgzip                                                         -
[-        ] process > short_variants:indels:get_ptato_vcfs:tabix                                                         -
[-        ] process > short_variants:indels:filter_ptato_vcfs:ptatoIndelFilter                                           -
[-        ] process > short_variants:indels:filter_ptato_vcfs:bgzip                                                      -
[-        ] process > short_variants:indels:filter_ptato_vcfs:tabix                                                      -
[-        ] process > short_variants:intersect_ptato_vcfs:intersectPTATO                                                 -
[-        ] process > short_variants:intersect_ptato_vcfs:bgzip                                                          -
[-        ] process > short_variants:intersect_ptato_vcfs:tabix                                                          -
[-        ] process > short_variants:merge_ptato_vcfs:mergePtatoVcfs                                                     -
[-        ] process > short_variants:merge_ptato_vcfs:bgzip                                                              -
[-        ] process > short_variants:merge_ptato_vcfs:tabix                                                              -

Best, Pedro

mroosmalen commented 1 year ago

Hi Pedro

Thank you for sharing your code and knowledge.

um-pdavila commented 1 year ago

Hi Dr. Roosmalen,

All but 29 of the walker sample processes, and final mergePtatoVcfs process have completed. : )

Can you confirm the short_variants[snvs, indels, and intersect] processes require the output of get_walker_vcfs:walker? We think they do, and there is a bug with the Pipeline caused when one of the child short_variants processes failed earlier today.

We also think we may need to set the walker_vcfs_dir and rerun the NextFlow pipeline after all 171 walker jobs complete.

    short_variants {
      somatic_vcfs_dir = '/sccc/scratch/fxd317/DATA/bioskryb_3-6-2023/PTATO_NEW/output/intermediate/short_variants/somatic_vcfs'
      walker_vcfs_dir = ''
      phased_vcfs_dir = '/sccc/scratch/fxd317/DATA/bioskryb_3-6-2023/PTATO_NEW/output/intermediate/short_variants/shapeit'
      ab_tables_dir = ''
      context_beds_dir = ''
      features_beds_dir = ''
    }

Below is the status of resuming the PTATO job.

executor >  slurm (4021)
[-        ] process > get_gzipped_vcfs:bgzip                                                                                           -
[6e/1f22f4] process > get_gzipped_vcfs:tabix (tabix joint_output-hc.g_noChr_PASS)                                                      [100%] 1 of 1, cached: 1 ✔
[-        ] process > get_indexed_bams:Index                                                                                           -
[a7/160a7c] process > get_indexed_bams:GetSampleName (GATK GetSampleName Oh-12453-306)                                                 [100%] 172 of 172, cached: 172 ✔
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:bgzip                                                                         -
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:tabix                                                                         -
[-        ] process > short_variants:get_gzipped_vcfs:bgzip                                                                            -
[-        ] process > short_variants:get_gzipped_vcfs:tabix                                                                            -
[a5/23b964] process > short_variants:get_ab_tables:createABtable (createABtable joint_output-hc.g.nochr_Oh-12453-201 2)                [100%] 3762 of 3762, cached: 3762 ✔
[22/87e117] process > short_variants:get_ab_tables:mergeABtable (mergeABtable joint_output-hc.g.nochr_Oh-12453-303)                    [100%] 171 of 171, cached: 171 ✔
[27/28811c] process > short_variants:get_walker_vcfs:walker (walker joint_output-hc.g.nochr_Oh-12453-250)                              [ 83%] 142 of 171, cached: 142
[71/4ab2e7] process > short_variants:get_walker_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-326)                                [100%] 142 of 142, cached: 142
[d6/8e0883] process > short_variants:get_walker_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-326)                                [100%] 142 of 142, cached: 142
[d6/ef7bc2] process > short_variants:get_context_beds:getContext (getContext joint_output-hc.g.nochr_Oh-12453-216)                     [100%] 171 of 171, cached: 171 ✔
[bb/c779ac] process > short_variants:get_context_beds:sort (bedtoolsSort joint_output-hc.g.nochr_Oh-12453-303)                         [100%] 171 of 171, cached: 171 ✔
[d9/1b6c08] process > short_variants:closest_feature:closest (bedtoolsClosest joint_output-hc.g.nochr_Oh-12453-331 SIMPLEREPEAT)       [100%] 342 of 342 ✔
[4a/71b744] process > short_variants:closest_feature:groupby (bedtoolsGroupby joint_output-hc.g.nochr_Oh-12453-331 SIMPLEREPEAT)       [100%] 342 of 342 ✔
[0f/1e4284] process > short_variants:intersect_feature:intersect (bedtoolsIntersect joint_output-hc.g.nochr_Oh-12453-331 REPLISEQ)     [100%] 342 of 342 ✔
[a9/1f35d0] process > short_variants:intersect_feature:groupby (bedtoolsGroupby joint_output-hc.g.nochr_Oh-12453-331 REPLISEQ)         [100%] 342 of 342 ✔
[a5/ce832b] process > short_variants:groupby_features:intersectAll (bedtoolsIntersectAll joint_output-hc.g.nochr_Oh-12453-351)         [100%] 171 of 171 ✔
[c1/028281] process > short_variants:groupby_features:groupbyAll (bedtoolsGroupbyAll joint_output-hc.g.nochr_Oh-12453-351)             [100%] 171 of 171 ✔
[f6/f40023] process > short_variants:SplitVcfs (GATK SplitVcfs joint_output-hc.g.nochr_Oh-12453-236)                                   [100%] 171 of 171, cached: 171 ✔
[0f/0cf7d0] process > short_variants:snvs:get_snvs_rf_tables:createSnvRfTable (createSnvRfTable joint_output-hc.g.nochr_Oh-12453-351)  [100%] 171 of 171 ✔
[74/33dd33] process > short_variants:snvs:get_ptato_vcfs:test_snv_rf (test_snv_rf joint_output-hc.g.nochr_Oh-12453-351)                [100%] 171 of 171 ✔
[d0/bc76a9] process > short_variants:snvs:get_ptato_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-351)                            [100%] 171 of 171 ✔
[42/24f243] process > short_variants:snvs:get_ptato_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-351)                            [100%] 171 of 171 ✔
[09/270114] process > short_variants:snvs:filter_ptato_vcfs:ptatoCutoff (ptatoCutoff joint_output-hc.g.nochr_Oh-12453-321)             [100%] 142 of 142
[86/c8c1cb] process > short_variants:snvs:filter_ptato_vcfs:ptatoFilter (ptatoFilter joint_output-hc.g.nochr_Oh-12453-321)             [100%] 142 of 142
[d9/b3b7d3] process > short_variants:snvs:filter_ptato_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-321)                         [100%] 142 of 142
[fd/864a79] process > short_variants:snvs:filter_ptato_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-321)                         [100%] 142 of 142
[44/5f6d79] process > short_variants:indels:get_ptato_vcfs:excludeIndels (excludeIndels joint_output-hc.g.nochr_Oh-12453-331)          [100%] 171 of 171 ✔
[62/8de235] process > short_variants:indels:get_ptato_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-331)                          [100%] 171 of 171 ✔
[15/aefaca] process > short_variants:indels:get_ptato_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-331)                          [100%] 171 of 171 ✔
[1a/e0238a] process > short_variants:indels:filter_ptato_vcfs:ptatoIndelFilter (ptatoIndelFilter joint_output-hc.g.nochr_Oh-12453-331) [100%] 171 of 171 ✔
[5d/6affbd] process > short_variants:indels:filter_ptato_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-331)                       [100%] 171 of 171 ✔
[d2/842399] process > short_variants:indels:filter_ptato_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-331)                       [100%] 171 of 171 ✔
[79/562a40] process > short_variants:intersect_ptato_vcfs:intersectPTATO (bedtoolsIntersectPTATO Oh-12453)                             [100%] 1 of 1 ✔
[a6/92db04] process > short_variants:intersect_ptato_vcfs:bgzip (bgzip Oh-12453)                                                       [100%] 1 of 1 ✔
[28/82ab63] process > short_variants:intersect_ptato_vcfs:tabix (tabix Oh-12453)                                                       [100%] 1 of 1 ✔
[a9/96d547] process > short_variants:merge_ptato_vcfs:mergePtatoVcfs (mergePtatoVcfs Oh-12453)                                         [  0%] 0 of 1
[-        ] process > short_variants:merge_ptato_vcfs:bgzip                                                                            -
[-        ] process > short_variants:merge_ptato_vcfs:tabix                                                                            -

Note: We increased the PTATO default ram and run time for the short_variants:intersect_ptato_vcfs process from 10G to 100G and 10min to 100min (over kill, I know). 10G was not enough ram with 172 samples, which caused the pipeline to fail.

The walker jobs originally showed at 100% complete (with only 141 samples) earlier this morning when the job also failed.

mroosmalen commented 1 year ago

This is the next step that needs the output of the walker jobs:

short_variants:snvs:filter_ptato_vcfs:ptatoCutoff

um-pdavila commented 1 year ago

Thank you Dr. Roosmalen,

We were able to complete the remaining walker samples using Walker.py outside of PTATO. We then modified the run.config, passing the computed gremlin and all the short_variants. The job completes but many processes do not run. I'm guessing we need to merge the walker sample vcf's like we did with SMuRF. Is that correct?

optional {

  germline_vcfs_dir = '/path/to/output/intermediate/germline'

  short_variants {
   somatic_vcfs_dir = '/path/to/output/intermediate/short_variants/somatic_vcfs'
   walker_vcfs_dir = '/path/to/output/intermediate/short_variants/walker'
   phased_vcfs_dir = '/path/to/output/intermediate/short_variants/shapeit'
   ab_tables_dir = '/path/to/output/intermediate/short_variants/ab'
   context_beds_dir = '/path/to/output/intermediate/short_variants/context'
   features_beds_dir = '/path/to/output/intermediate/short_variants/features'
  }
executor >  slurm (1084)
[-        ] process > get_gzipped_vcfs:bgzip                                                                                           -
[c8/8ea855] process > get_gzipped_vcfs:tabix (tabix joint_output-hc.g_noChr_PASS)                                                      [100%] 1 of 1 ✔
[-        ] process > get_indexed_bams:Index                                                                                           -
[0b/cad156] process > get_indexed_bams:GetSampleName (GATK GetSampleName Oh-12453-335)                                                 [100%] 172 of 172, cached: 172 ✔
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:bgzip                                                                         -
[-        ] process > get_germline_vcfs:get_gzipped_vcfs:tabix                                                                         -
[-        ] process > short_variants:get_gzipped_vcfs:bgzip                                                                            -
[-        ] process > short_variants:get_gzipped_vcfs:tabix                                                                            -
[-        ] process > short_variants:get_gzipped_vcfs2:bgzip                                                                           -
[-        ] process > short_variants:get_gzipped_vcfs2:tabix                                                                           -
[bd/63ed7d] process > short_variants:SplitVcfs (GATK SplitVcfs joint_output-hc.g.nochr_Oh-12453-241)                                   [100%] 171 of 171, cached: 114 ✔
[-        ] process > short_variants:snvs:get_snvs_rf_tables:createSnvRfTable                                                          -
[-        ] process > short_variants:snvs:get_ptato_vcfs:test_snv_rf                                                                   -
[-        ] process > short_variants:snvs:get_ptato_vcfs:bgzip                                                                         -
[-        ] process > short_variants:snvs:get_ptato_vcfs:tabix                                                                         -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoCutoff                                                                -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:ptatoFilter                                                                -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:bgzip                                                                      -
[-        ] process > short_variants:snvs:filter_ptato_vcfs:tabix                                                                      -
[82/12b0f0] process > short_variants:indels:get_ptato_vcfs:excludeIndels (excludeIndels joint_output-hc.g.nochr_Oh-12453-241)          [100%] 171 of 171 ✔
[7d/a1eab9] process > short_variants:indels:get_ptato_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-241)                          [100%] 171 of 171 ✔
[f7/0c6d97] process > short_variants:indels:get_ptato_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-241)                          [100%] 171 of 171 ✔
[ba/ba2b10] process > short_variants:indels:filter_ptato_vcfs:ptatoIndelFilter (ptatoIndelFilter joint_output-hc.g.nochr_Oh-12453-241) [100%] 171 of 171 ✔
[3b/6562d0] process > short_variants:indels:filter_ptato_vcfs:bgzip (bgzip joint_output-hc.g.nochr_Oh-12453-241)                       [100%] 171 of 171 ✔
[33/39fa00] process > short_variants:indels:filter_ptato_vcfs:tabix (tabix joint_output-hc.g.nochr_Oh-12453-241)                       [100%] 171 of 171 ✔
[-        ] process > short_variants:intersect_ptato_vcfs:intersectPTATO                                                               -
[-        ] process > short_variants:intersect_ptato_vcfs:bgzip                                                                        -
[-        ] process > short_variants:intersect_ptato_vcfs:tabix                                                                        -
[-        ] process > short_variants:merge_ptato_vcfs:mergePtatoVcfs                                                                   -
[-        ] process > short_variants:merge_ptato_vcfs:bgzip                                                                            -
[-        ] process > short_variants:merge_ptato_vcfs:tabix                                                                            -
Completed at: 26-May-2023 11:37:27
Duration    : 7m 42s
CPU hours   : 2.1 (23.8% cached)
Succeeded   : 1'084
Cached      : 286
mroosmalen commented 1 year ago

The next step [- ] process > short_variants:snvs:get_snvs_rf_tables:createSnvRfTable needs the following input ab_tables and features_beds (line 18 in snvs.nf). There was a bug in with getting the names of the features bed files from the optional parameter. It was still looking for (the old) merged.bed files in stead of (the new) groupby.bed. Can you please try again with the newest version?