icbi-lab / nextNEOpi

nextNEOpi: a comprehensive pipeline for computational neoantigen prediction
Other
65 stars 23 forks source link

Error executing process > 'gatherMutect2VCFs (sample2)' #53

Closed amoschoomy closed 8 months ago

amoschoomy commented 9 months ago

Hello again, now the pipeline is getting this error:

executor >  local (821)
[-        ] process > merge_fastq                    -
[44/a23509] process > RegionsBedToIntervalList (R... [100%] 1 of 1, cached: 1 ✔
[fe/1d4ae2] process > BaitsBedToIntervalList (Bai... [100%] 1 of 1, cached: 1 ✔
[5e/263dc4] process > preprocessIntervalList (pre... [100%] 1 of 1, cached: 1 ✔
[11/95224e] process > SplitIntervals (SplitInterv... [100%] 1 of 1 ✔
[aa/41e120] process > IntervalListToBed (BedFromI... [100%] 1 of 1, cached: 1 ✔
[15/e07cf2] process > ScatteredIntervalListToBed ... [100%] 40 of 40 ✔
[73/d08e65] process > FastQC (sample2 : normal_DNA)  [100%] 6 of 6, cached: 6 ✔
[72/921d8d] process > fastp (sample1 : normal_DNA)   [100%] 6 of 6, cached: 6 ✔
[76/a4856d] process > FastQC_trimmed (sample1 : n... [100%] 6 of 6, cached: 6 ✔
[db/6b4ce8] process > make_uBAM (sample1 : tumor_... [100%] 4 of 4 ✔
[7f/d6ef3b] process > Bwa (sample1 : normal_DNA)     [100%] 4 of 4 ✔
[a2/8dc9f3] process > merge_uBAM_BAM (sample1 : n... [100%] 4 of 4 ✔
[87/debcda] process > MarkDuplicates (sample1 : t... [100%] 4 of 4 ✔
[9e/8ca93a] process > alignmentMetrics (sample1 :... [100%] 4 of 4 ✔
[18/10cd02] process > scatterBaseRecalGATK4 (samp... [100%] 160 of 160 ✔
[15/d3c28d] process > gatherGATK4scsatteredBQSRta... [100%] 4 of 4 ✔
[61/824d8b] process > scatterGATK4applyBQSRS (sam... [100%] 160 of 160 ✔
[5b/97632f] process > GatherRecalBamFiles (sample... [100%] 4 of 4 ✔
[93/ecf78f] process > GetPileup (sample1 : tumor_... [100%] 4 of 4 ✔
[1e/4fdf9c] process > Mutect2 (sample1)              [100%] 80 of 80 ✔
[32/4f5f39] process > gatherMutect2VCFs (sample2)    [  0%] 0 of 2
[-        ] process > FilterMutect2                  -
[dd/bb6fd8] process > HaploTypeCaller (sample2)      [100%] 80 of 80 ✔
[4d/c359f2] process > CNNScoreVariants (sample2)     [100%] 80 of 80 ✔
[c6/485f54] process > MergeHaploTypeCallerGermlin... [100%] 2 of 2 ✔
[6e/c5352d] process > FilterGermlineVariantTranch... [100%] 2 of 2 ✔
[fa/4e56ca] process > IndelRealignerIntervals (sa... [100%] 160 of 160 ✔
[a4/70f418] process > GatherRealignedBamFiles (sa... [  0%] 0 of 4
[-        ] process > VarscanSomaticScattered        -
[-        ] process > gatherVarscanVCFs              -
[-        ] process > ProcessVarscan                 -
[-        ] process > FilterVarscan                  -
[-        ] process > MergeAndRenameSamplesInVars... -
[69/2cda8b] process > MantaSomaticIndels (sample1)   [  0%] 0 of 2
[-        ] process > StrelkaSomatic                 -
[-        ] process > finalizeStrelkaVCF             -
[-        ] process > mkHCsomaticVCF                 -
[-        ] process > VepTab                         -
[-        ] process > mkCombinedVCF                  -
[-        ] process > VEPvcf                         -
[-        ] process > ReadBackedphasing              -
[-        ] process > AlleleCounter                  -
[-        ] process > ConvertAlleleCounts            -
[-        ] process > Ascat                          -
[-        ] process > SequenzaUtils                  -
[-        ] process > gatherSequenzaInput            -
[-        ] process > Sequenza                       -
[d8/ddb69d] process > make_CNVkit_access_file (mk... [100%] 1 of 1, cached: 1 ✔
[-        ] process > CNVkit                         -
[-        ] process > Clonality                      -
[-        ] process > MutationalBurden               -
[-        ] process > MutationalBurdenCoding         -
[c9/0a8c38] process > mhc_extract (sample1)          [100%] 2 of 2 ✔
[61/02d086] process > pre_map_hla (sample1)          [100%] 2 of 2 ✔
[d2/d9df9d] process > OptiType (sample1)             [100%] 2 of 2 ✔
[00/6f1a3c] process > pre_map_hla_RNA (sample1)      [100%] 2 of 2, cached: 2 ✔
[90/95cd23] process > OptiType_RNA (sample1)         [100%] 2 of 2, cached: 2 ✔
[55/8407cb] process > run_hla_hd (sample1)           [100%] 2 of 2 ✔
[2e/3948a1] process > get_vhla (sample1)             [100%] 2 of 2 ✔
[-        ] process > Neofuse                        -
[-        ] process > publish_NeoFuse                -
[-        ] process > add_geneID                     -
[-        ] process > gene_annotator                 -
[-        ] process > pVACseq                        -
[-        ] process > concat_pVACseq_files           -
[-        ] process > aggregated_reports             -
[-        ] process > pVACtools_generate_protein_seq -
[-        ] process > pepare_mixMHC2_seq             -
[-        ] process > mixMHC2pred                    -
[-        ] process > addCCF                         -
[-        ] process > make_epitopes_fasta            -
[-        ] process > blast_epitopes                 -
[-        ] process > add_blast_hits                 -
[-        ] process > csin                           -
[-        ] process > immunogenicity_scoring         -
[94/155f86] process > mixcr (sample1 : tumor_RNA)    [100%] 6 of 6 ✔
[-        ] process > collectSampleInfo              -
[cb/7c36bf] process > multiQC (sample1)              [100%] 2 of 2 ✔
Error executing process > 'gatherMutect2VCFs (sample2)'

Caused by:
  Process `gatherMutect2VCFs (sample2)` terminated with an error exit status (2)

Command executed:

  mkdir -p /scratch/temp/5859185

  gatk --java-options "-Xmx32G -XX:ParallelGCThreads=1" MergeVcfs \
      --TMP_DIR /scratch/temp/5859185 \
      -I sample2_0001-scattered.interval_list.vcf.gz -I sample2_0033-scattered.interval_list.vcf.gz -I sample2_0016-scattered.interval_list.vcf.gz -I sample2_0000-scattered.interval_list.vcf.gz -I sample2_0031-scattered.interval_list.vcf.gz -I sample2_0002-scattered.interval_list.vcf.gz -I sample2_0038-scattered.interval_list.vcf.gz -I sample2_0023-scattered.interval_list.vcf.gz -I sample2_0011-scattered.interval_list.vcf.gz -I sample2_0037-scattered.interval_list.vcf.gz -I sample2_0030-scattered.interval_list.vcf.gz -I sample2_0012-scattered.interval_list.vcf.gz -I sample2_0032-scattered.interval_list.vcf.gz -I sample2_0014-scattered.interval_list.vcf.gz -I sample2_0007-scattered.interval_list.vcf.gz -I sample2_0015-scattered.interval_list.vcf.gz -I sample2_0028-scattered.interval_list.vcf.gz -I sample2_0005-scattered.interval_list.vcf.gz -I sample2_0008-scattered.interval_list.vcf.gz -I sample2_0027-scattered.interval_list.vcf.gz -I sample2_0004-scattered.interval_list.vcf.gz -I sample2_0021-scattered.interval_list.vcf.gz -I sample2_0003-scattered.interval_list.vcf.gz -I sample2_0035-scattered.interval_list.vcf.gz -I sample2_0018-scattered.interval_list.vcf.gz -I sample2_0020-scattered.interval_list.vcf.gz -I sample2_0017-scattered.interval_list.vcf.gz -I sample2_0019-scattered.interval_list.vcf.gz -I sample2_0039-scattered.interval_list.vcf.gz -I sample2_0013-scattered.interval_list.vcf.gz -I sample2_0029-scattered.interval_list.vcf.gz -I sample2_0024-scattered.interval_list.vcf.gz -I sample2_0036-scattered.interval_list.vcf.gz -I sample2_0006-scattered.interval_list.vcf.gz -I sample2_0025-scattered.interval_list.vcf.gz -I sample2_0026-scattered.interval_list.vcf.gz -I sample2_0010-scattered.interval_list.vcf.gz -I sample2_0009-scattered.interval_list.vcf.gz -I sample2_0022-scattered.interval_list.vcf.gz -I sample2_0034-scattered.interval_list.vcf.gz \
      -O sample2_mutect2_raw.vcf.gz

  gatk MergeMutectStats \
      --tmp-dir /scratch/temp/5859185 \
      --stats sample2_0001-scattered.interval_list.vcf.gz.stats --stats sample2_0033-scattered.interval_list.vcf.gz.stats --stats sample2_0016-scattered.interval_list.vcf.gz.stats --stats sample2_0000-scattered.interval_list.vcf.gz.stats --stats sample2_0031-scattered.interval_list.vcf.gz.stats --stats sample2_0002-scattered.interval_list.vcf.gz.stats --stats sample2_0038-scattered.interval_list.vcf.gz.stats --stats sample2_0023-scattered.interval_list.vcf.gz.stats --stats sample2_0011-scattered.interval_list.vcf.gz.stats --stats sample2_0037-scattered.interval_list.vcf.gz.stats --stats sample2_0030-scattered.interval_list.vcf.gz.stats --stats sample2_0012-scattered.interval_list.vcf.gz.stats --stats sample2_0032-scattered.interval_list.vcf.gz.stats --stats sample2_0014-scattered.interval_list.vcf.gz.stats --stats sample2_0007-scattered.interval_list.vcf.gz.stats --stats sample2_0015-scattered.interval_list.vcf.gz.stats --stats sample2_0028-scattered.interval_list.vcf.gz.stats --stats sample2_0005-scattered.interval_list.vcf.gz.stats --stats sample2_0008-scattered.interval_list.vcf.gz.stats --stats sample2_0027-scattered.interval_list.vcf.gz.stats --stats sample2_0004-scattered.interval_list.vcf.gz.stats --stats sample2_0021-scattered.interval_list.vcf.gz.stats --stats sample2_0003-scattered.interval_list.vcf.gz.stats --stats sample2_0035-scattered.interval_list.vcf.gz.stats --stats sample2_0018-scattered.interval_list.vcf.gz.stats --stats sample2_0020-scattered.interval_list.vcf.gz.stats --stats sample2_0017-scattered.interval_list.vcf.gz.stats --stats sample2_0019-scattered.interval_list.vcf.gz.stats --stats sample2_0039-scattered.interval_list.vcf.gz.stats --stats sample2_0013-scattered.interval_list.vcf.gz.stats --stats sample2_0029-scattered.interval_list.vcf.gz.stats --stats sample2_0024-scattered.interval_list.vcf.gz.stats --stats sample2_0036-scattered.interval_list.vcf.gz.stats --stats sample2_0006-scattered.interval_list.vcf.gz.stats --stats sample2_0025-scattered.interval_list.vcf.gz.stats --stats sample2_0026-scattered.interval_list.vcf.gz.stats --stats sample2_0010-scattered.interval_list.vcf.gz.stats --stats sample2_0009-scattered.interval_list.vcf.gz.stats --stats sample2_0022-scattered.interval_list.vcf.gz.stats --stats sample2_0034-scattered.interval_list.vcf.gz.stats \
      -O sample2_mutect2_raw.vcf.gz.stats

  gatk LearnReadOrientationModel \
      --tmp-dir /scratch/temp/5859185 \
      -I sample2_0001-scattered.interval_list-f1r2.tar.gz -I sample2_0033-scattered.interval_list-f1r2.tar.gz -I sample2_0016-scattered.interval_list-f1r2.tar.gz -I sample2_0000-scattered.interval_list-f1r2.tar.gz -I sample2_0031-scattered.interval_list-f1r2.tar.gz -I sample2_0002-scattered.interval_list-f1r2.tar.gz -I sample2_0038-scattered.interval_list-f1r2.tar.gz -I sample2_0023-scattered.interval_list-f1r2.tar.gz -I sample2_0011-scattered.interval_list-f1r2.tar.gz -I sample2_0037-scattered.interval_list-f1r2.tar.gz -I sample2_0030-scattered.interval_list-f1r2.tar.gz -I sample2_0012-scattered.interval_list-f1r2.tar.gz -I sample2_0032-scattered.interval_list-f1r2.tar.gz -I sample2_0014-scattered.interval_list-f1r2.tar.gz -I sample2_0007-scattered.interval_list-f1r2.tar.gz -I sample2_0015-scattered.interval_list-f1r2.tar.gz -I sample2_0028-scattered.interval_list-f1r2.tar.gz -I sample2_0005-scattered.interval_list-f1r2.tar.gz -I sample2_0008-scattered.interval_list-f1r2.tar.gz -I sample2_0027-scattered.interval_list-f1r2.tar.gz -I sample2_0004-scattered.interval_list-f1r2.tar.gz -I sample2_0021-scattered.interval_list-f1r2.tar.gz -I sample2_0003-scattered.interval_list-f1r2.tar.gz -I sample2_0035-scattered.interval_list-f1r2.tar.gz -I sample2_0018-scattered.interval_list-f1r2.tar.gz -I sample2_0020-scattered.interval_list-f1r2.tar.gz -I sample2_0017-scattered.interval_list-f1r2.tar.gz -I sample2_0019-scattered.interval_list-f1r2.tar.gz -I sample2_0039-scattered.interval_list-f1r2.tar.gz -I sample2_0013-scattered.interval_list-f1r2.tar.gz -I sample2_0029-scattered.interval_list-f1r2.tar.gz -I sample2_0024-scattered.interval_list-f1r2.tar.gz -I sample2_0036-scattered.interval_list-f1r2.tar.gz -I sample2_0006-scattered.interval_list-f1r2.tar.gz -I sample2_0025-scattered.interval_list-f1r2.tar.gz -I sample2_0026-scattered.interval_list-f1r2.tar.gz -I sample2_0010-scattered.interval_list-f1r2.tar.gz -I sample2_0009-scattered.interval_list-f1r2.tar.gz -I sample2_0022-scattered.interval_list-f1r2.tar.gz -I sample2_0034-scattered.interval_list-f1r2.tar.gz \
      -O sample2_read-orientation-model.tar.gz

Command exit status:
  2

Command output:
  Tool returned:
  0
  Tool returned:
  SUCCESS

Command error:
  14:38:58.653 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.654 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.677 INFO  IOUtils - Extracting data from archive: file://sample2_0002-scattered.interval_list-f1r2.tar.gz
  14:38:58.709 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.710 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.710 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.728 INFO  IOUtils - Extracting data from archive: file://sample2_0038-scattered.interval_list-f1r2.tar.gz
  14:38:58.744 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.745 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.745 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.785 INFO  IOUtils - Extracting data from archive: file://sample2_0023-scattered.interval_list-f1r2.tar.gz
  14:38:58.800 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.801 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.802 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.820 INFO  IOUtils - Extracting data from archive: file://sample2_0011-scattered.interval_list-f1r2.tar.gz
  14:38:58.836 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.836 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.837 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.870 INFO  IOUtils - Extracting data from archive: file://sample2_0037-scattered.interval_list-f1r2.tar.gz
  14:38:58.886 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.887 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.887 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.910 INFO  IOUtils - Extracting data from archive: file://sample2_0030-scattered.interval_list-f1r2.tar.gz
  14:38:58.918 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.919 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.919 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.936 INFO  IOUtils - Extracting data from archive: file://sample2_0012-scattered.interval_list-f1r2.tar.gz
  14:38:58.942 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.943 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.943 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:58.981 INFO  IOUtils - Extracting data from archive: file://sample2_0032-scattered.interval_list-f1r2.tar.gz
  14:38:58.990 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:58.991 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:58.991 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:59.014 INFO  IOUtils - Extracting data from archive: file://sample2_0014-scattered.interval_list-f1r2.tar.gz
  14:38:59.020 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_histogram
  14:38:59.021 INFO  IOUtils - Extracting file: ./sample2_tumor.ref_histogram
  14:38:59.022 INFO  IOUtils - Extracting file: ./sample2_tumor.alt_table
  14:38:59.033 INFO  LearnReadOrientationModel - Shutting down engine
  [September 28, 2023 at 2:38:59 PM UTC] org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel done. Elapsed time: 0.01 minutes.
  Runtime.totalMemory()=285212672
  ***********************************************************************

  A USER ERROR has occurred: Could not extract data from: file://sample2_0014-scattered.interval_list-f1r2.tar.gz

  ***********************************************************************
  Set the system property GATK_STACKTRACE_ON_USER_EXCEPTION (--java-options '-DGATK_STACKTRACE_ON_USER_EXCEPTION=true') to print the stack trace.
  Using GATK jar /opt/gatk/gatk-package-4.4.0.0-local.jar
  Running:
      java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -jar /opt/gatk/gatk-package-4.4.0.0-local.jar LearnReadOrientationModel --tmp-dir /scratch/temp/5859185 -I sample2_0001-scattered.interval_list-f1r2.tar.gz -I sample2_0033-scattered.interval_list-f1r2.tar.gz -I sample2_0016-scattered.interval_list-f1r2.tar.gz -I sample2_0000-scattered.interval_list-f1r2.tar.gz -I sample2_0031-scattered.interval_list-f1r2.tar.gz -I sample2_0002-scattered.interval_list-f1r2.tar.gz -I sample2_0038-scattered.interval_list-f1r2.tar.gz -I sample2_0023-scattered.interval_list-f1r2.tar.gz -I sample2_0011-scattered.interval_list-f1r2.tar.gz -I sample2_0037-scattered.interval_list-f1r2.tar.gz -I sample2_0030-scattered.interval_list-f1r2.tar.gz -I sample2_0012-scattered.interval_list-f1r2.tar.gz -I sample2_0032-scattered.interval_list-f1r2.tar.gz -I sample2_0014-scattered.interval_list-f1r2.tar.gz -I sample2_0007-scattered.interval_list-f1r2.tar.gz -I sample2_0015-scattered.interval_list-f1r2.tar.gz -I sample2_0028-scattered.interval_list-f1r2.tar.gz -I sample2_0005-scattered.interval_list-f1r2.tar.gz -I sample2_0008-scattered.interval_list-f1r2.tar.gz -I sample2_0027-scattered.interval_list-f1r2.tar.gz -I sample2_0004-scattered.interval_list-f1r2.tar.gz -I sample2_0021-scattered.interval_list-f1r2.tar.gz -I sample2_0003-scattered.interval_list-f1r2.tar.gz -I sample2_0035-scattered.interval_list-f1r2.tar.gz -I sample2_0018-scattered.interval_list-f1r2.tar.gz -I sample2_0020-scattered.interval_list-f1r2.tar.gz -I sample2_0017-scattered.interval_list-f1r2.tar.gz -I sample2_0019-scattered.interval_list-f1r2.tar.gz -I sample2_0039-scattered.interval_list-f1r2.tar.gz -I sample2_0013-scattered.interval_list-f1r2.tar.gz -I sample2_0029-scattered.interval_list-f1r2.tar.gz -I sample2_0024-scattered.interval_list-f1r2.tar.gz -I sample2_0036-scattered.interval_list-f1r2.tar.gz -I sample2_0006-scattered.interval_list-f1r2.tar.gz -I sample2_0025-scattered.interval_list-f1r2.tar.gz -I sample2_0026-scattered.interval_list-f1r2.tar.gz -I sample2_0010-scattered.interval_list-f1r2.tar.gz -I sample2_0009-scattered.interval_list-f1r2.tar.gz -I sample2_0022-scattered.interval_list-f1r2.tar.gz -I sample2_0034-scattered.interval_list-f1r2.tar.gz -O sample2_read-orientation-model.tar.gz

Work dir:
  /QRISdata/Q6373/results/results_1/work/32/4f5f39ab6af08ba9cc90eed3361958

Tip: you can replicate the issue by changing to the process work dir and entering the command `bash .command.run`
riederd commented 9 months ago

This is a GATK error. Just to be sure, can you try to manually reproduce it by:

cd /QRISdata/Q6373/results/results_1/work/32/4f5f39ab6af08ba9cc90eed3361958
bash .command.run
amoschoomy commented 9 months ago

Hi sorry for the late reply, I've actually rerun it but I ran into the same error but on a different work directory. And I ran what you said and run into ln: failed to create symbolic link 'sample1_0039-scattered.interval_list.vcf.gz': File exists

work.tar.gz

I've uploaded the work directory for your use

riederd commented 9 months ago

It seems that /QRISdata/Q6373/results/results_1/work/f3/1a75384eb71df85c5cac8ba99d4b6c/sample1_0015-scattered.interval_list-f1r2.tar.gz does not exist. Can you change into /QRISdata/Q6373/results/results_1/work/f3/1a75384eb71df85c5cac8ba99d4b6c/ and post the contents of the directory as tar.gz.?

amoschoomy commented 9 months ago

Hi, I've checked it and the file existed but nevertheless here are the directory contents work2.tar.gz

Thanks!

riederd commented 9 months ago

this is strange. May I ask you to try:

cd /QRISdata/Q6373/results/results_1/work/ae/e43079d0e13743a5bd94b0f958e0e7
singularity exec -B /QRISdata/Q6373/results/results_1/work -B /home/uqachoo1/nextNEOpi-1.4.0/bin --no-home --containall -H /scratch/temp/5890914 -B /home/uqachoo1/nextNEOpi-1.4.0/assets -B /scratch/temp/5890914 -B /QRISdata/Q6196/resources -B /home/uqachoo1/hlahd.1.7.0 -B /QRISdata/Q6196/resources/databases/iedb:/opt/iedb -B /QRISdata/Q6196/resources/databases/mhcflurry_data:/opt/mhcflurry_data /QRISdata/Q6373/results/results_1/work/singularity/apps-01.i-med.ac.at-images-singularity-nextNEOpi_1.4.0_f894e892.sif /bin/bash

and then

gatk LearnReadOrientationModel \
    --tmp-dir /scratch/temp/5890914 \
    -I sample1_0039-scattered.interval_list-f1r2.tar.gz -I sample1_0003-scattered.interval_list-f1r2.tar.gz -I sample1_0017-scattered.interval_list-f1r2.tar.gz -I sample1_0029-scattered.interval_list-f1r2.tar.gz -I sample1_0025-scattered.interval_list-f1r2.tar.gz -I sample1_0009-scattered.interval_list-f1r2.tar.gz -I sample1_0002-scattered.interval_list-f1r2.tar.gz -I sample1_0027-scattered.interval_list-f1r2.tar.gz -I sample1_0036-scattered.interval_list-f1r2.tar.gz -I sample1_0021-scattered.interval_list-f1r2.tar.gz -I sample1_0028-scattered.interval_list-f1r2.tar.gz -I sample1_0012-scattered.interval_list-f1r2.tar.gz -I sample1_0004-scattered.interval_list-f1r2.tar.gz -I sample1_0033-scattered.interval_list-f1r2.tar.gz -I sample1_0014-scattered.interval_list-f1r2.tar.gz -I sample1_0019-scattered.interval_list-f1r2.tar.gz -I sample1_0011-scattered.interval_list-f1r2.tar.gz -I sample1_0038-scattered.interval_list-f1r2.tar.gz -I sample1_0015-scattered.interval_list-f1r2.tar.gz -I sample1_0005-scattered.interval_list-f1r2.tar.gz -I sample1_0018-scattered.interval_list-f1r2.tar.gz -I sample1_0026-scattered.interval_list-f1r2.tar.gz -I sample1_0035-scattered.interval_list-f1r2.tar.gz -I sample1_0030-scattered.interval_list-f1r2.tar.gz -I sample1_0016-scattered.interval_list-f1r2.tar.gz -I sample1_0001-scattered.interval_list-f1r2.tar.gz -I sample1_0013-scattered.interval_list-f1r2.tar.gz -I sample1_0008-scattered.interval_list-f1r2.tar.gz -I sample1_0022-scattered.interval_list-f1r2.tar.gz -I sample1_0024-scattered.interval_list-f1r2.tar.gz -I sample1_0034-scattered.interval_list-f1r2.tar.gz -I sample1_0006-scattered.interval_list-f1r2.tar.gz -I sample1_0020-scattered.interval_list-f1r2.tar.gz -I sample1_0010-scattered.interval_list-f1r2.tar.gz -I sample1_0031-scattered.interval_list-f1r2.tar.gz -I sample1_0037-scattered.interval_list-f1r2.tar.gz -I sample1_0000-scattered.interval_list-f1r2.tar.gz -I sample1_0007-scattered.interval_list-f1r2.tar.gz -I sample1_0032-scattered.interval_list-f1r2.tar.gz -I sample1_0023-scattered.interval_list-f1r2.tar.gz \
    -O sample1_read-orientation-model.tar.gz

and

gatk LearnReadOrientationModel \
    --tmp-dir /scratch/temp/5890914 \
    -I  sample1_0014-scattered.interval_list-f1r2.tar.gz \
    -O int_0014_read-orientation-model.tar.gz
zktuong commented 9 months ago

hi, i've been experiencing this as well. I ran it manually as you suggested and I chased it down to --containall was stuffing it up in the singularity exec step. removing it manually made it work.

i'm not well versed with nextflow and nextNEOpi so could you tell me where should i disable this??

zktuong commented 9 months ago

ok sorry now i'm getting a better sense of what's happening.

for clarity, instead of the original code: https://github.com/icbi-lab/nextNEOpi/blob/fe7b21cdc0b97aae38195e5f5ac1b9851674f6b1/conf/params.config#L277-L281

The copy of my conf/params.config was the following as i had previous seen this #25

singularity {
    enabled = true
    autoMounts = true
    runOptions =  "--no-home --containall" + " -H " + params.singularityTmpMount + " -B " +  params.singularityAssetsMount + " -B " + params.singularityTmpMount + " -B " + params.resourcesBaseDir + params.singularityHLAHDmount + " -B " + params.databases.IEDB_dir + ":/opt/iedb" + " -B " + params.databases.MHCFLURRY_dir + ":/opt/mhcflurry_data"
}

so it seems that having --containall could make the NeoFuse works but not the gatherMutect2VCFs step.

Is there a more permanent solution? seems a bit tedious to have to run the pipeline twice (without and with --containall) as the workaround?

riederd commented 9 months ago

Hi, the --containall run option was removed from the default in nextNEOpi see https://github.com/icbi-lab/nextNEOpi/commit/ebfb100f13f0d00070885fca0337072095f29866

If you need it for NeoFuse to work, you may put it into:

https://github.com/icbi-lab/nextNEOpi/blob/71046cf9ca4af2560879fd693a24a15f991ae556/conf/process.config#L169-L172

e.g.:

     withName:Neofuse {
        container = 'https://apps-01.i-med.ac.at/images/singularity/NeoFuse_dev_92712e07.sif'
        cpus = 10
        containerOptions = "--containall"
    }

Let me know if this works for you.

amoschoomy commented 9 months ago

Hi @riederd I've tried executing it manually and it worked (I had to specify a different $TMPDIR on that manual command as i do not have access to the temp directory at runtime).

I then reexecuted the pipeline with the full stack trace of GATK enabled and now I'm getting the same issue. Apparently I have no space left on my device? Which is weird because I've allocated 5TB on my shared project space and $TMPDIR is setup and allocated automatically by the HPC system at the submission of jobs (which would be sufficient for most of the heavy jobs?) I'm not sure how much more do GATK needs in regards to this or is there a way to reduce the space usage?

Command error:
  17:39:44.852 INFO  IOUtils - Extracting file: ./sample1_tumor.alt_table
  17:39:44.857 INFO  IOUtils - Extracting data from archive: file:///scratch/project_mnt/S0142/results/results_1/work/84/389049f48f5f9b757eb2fcc716e0fa/sample1_0028-scattered.interval_list-f1r2.tar.gz
  17:39:44.858 INFO  IOUtils - Extracting file: ./sample1_tumor.alt_histogram
  17:39:44.859 INFO  IOUtils - Extracting file: ./sample1_tumor.ref_histogram
  17:39:44.859 INFO  IOUtils - Extracting file: ./sample1_tumor.alt_table
  17:39:44.865 INFO  IOUtils - Extracting data from archive: file:///scratch/project_mnt/S0142/results/results_1/work/84/389049f48f5f9b757eb2fcc716e0fa/sample1_0032-scattered.interval_list-f1r2.tar.gz
  17:39:44.867 INFO  IOUtils - Extracting file: ./sample1_tumor.alt_histogram
  17:39:44.867 INFO  IOUtils - Extracting file: ./sample1_tumor.ref_histogram
  17:39:44.868 INFO  IOUtils - Extracting file: ./sample1_tumor.alt_table
  17:39:44.869 INFO  LearnReadOrientationModel - Shutting down engine
  [October 16, 2023 at 5:39:44 PM UTC] org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel done. Elapsed time: 0.00 minutes.
  Runtime.totalMemory()=335544320
  ***********************************************************************

  A USER ERROR has occurred: Could not extract data from: file:///scratch/project_mnt/S0142/results/results_1/work/84/389049f48f5f9b757eb2fcc716e0fa/sample1_0032-scattered.interval_list-f1r2.tar.gz

  ***********************************************************************
  org.broadinstitute.hellbender.exceptions.UserException: Could not extract data from: file:///scratch/project_mnt/S0142/results/results_1/work/84/389049f48f5f9b757eb2fcc716e0fa/sample1_0032-scattered.interval_list-f1r2.tar.gz
    at org.broadinstitute.hellbender.utils.io.IOUtils.extractTarGz(IOUtils.java:458)
    at org.broadinstitute.hellbender.utils.io.IOUtils.extractTarGz(IOUtils.java:434)
    at org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel.lambda$doWork$1(LearnReadOrientationModel.java:86)
    at java.base/java.util.stream.Streams$RangeIntSpliterator.forEachRemaining(Streams.java:104)
    at java.base/java.util.stream.IntPipeline$Head.forEach(IntPipeline.java:617)
    at org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel.doWork(LearnReadOrientationModel.java:86)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:149)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:198)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:217)
    at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:160)
    at org.broadinstitute.hellbender.Main.mainEntry(Main.java:203)
    at org.broadinstitute.hellbender.Main.main(Main.java:289)
  Caused by: java.io.IOException: No space left on device
    at java.base/sun.nio.ch.FileDispatcherImpl.write0(Native Method)
    at java.base/sun.nio.ch.FileDispatcherImpl.write(FileDispatcherImpl.java:62)
    at java.base/sun.nio.ch.IOUtil.writeFromNativeBuffer(IOUtil.java:132)
    at java.base/sun.nio.ch.IOUtil.write(IOUtil.java:97)
    at java.base/sun.nio.ch.IOUtil.write(IOUtil.java:67)
    at java.base/sun.nio.ch.FileChannelImpl.write(FileChannelImpl.java:288)
    at java.base/java.nio.channels.Channels.writeFullyImpl(Channels.java:74)
    at java.base/java.nio.channels.Channels.writeFully(Channels.java:96)
    at java.base/java.nio.channels.Channels$1.write(Channels.java:171)
    at org.apache.commons.io.IOUtils.copyLarge(IOUtils.java:1310)
    at org.apache.commons.io.IOUtils.copy(IOUtils.java:978)
    at org.apache.commons.io.IOUtils.copyLarge(IOUtils.java:1282)
    at org.apache.commons.io.IOUtils.copy(IOUtils.java:953)
    at org.broadinstitute.hellbender.utils.io.IOUtils.extractFilesFromArchiveStream(IOUtils.java:523)
    at org.broadinstitute.hellbender.utils.io.IOUtils.extractTarGz(IOUtils.java:455)
    ... 11 more
  Using GATK jar /opt/gatk/gatk-package-4.4.0.0-local.jar
  Running:
      java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -Xmx50G -XX:ParallelGCThreads=60 -DGATK_STACKTRACE_ON_USER_EXCEPTION=true -jar /opt/gatk/gatk-package-4.4.0.0-local.jar LearnReadOrientationModel --tmp-dir /scratch/temp/5976135 -I sample1_0034-scattered.interval_list-f1r2.tar.gz -I sample1_0025-scattered.interval_list-f1r2.tar.gz -I sample1_0036-scattered.interval_list-f1r2.tar.gz -I sample1_0001-scattered.interval_list-f1r2.tar.gz -I sample1_0038-scattered.interval_list-f1r2.tar.gz -I sample1_0020-scattered.interval_list-f1r2.tar.gz -I sample1_0023-scattered.interval_list-f1r2.tar.gz -I sample1_0016-scattered.interval_list-f1r2.tar.gz -I sample1_0010-scattered.interval_list-f1r2.tar.gz -I sample1_0022-scattered.interval_list-f1r2.tar.gz -I sample1_0019-scattered.interval_list-f1r2.tar.gz -I sample1_0004-scattered.interval_list-f1r2.tar.gz -I sample1_0028-scattered.interval_list-f1r2.tar.gz -I sample1_0032-scattered.interval_list-f1r2.tar.gz -I sample1_0018-scattered.interval_list-f1r2.tar.gz -I sample1_0011-scattered.interval_list-f1r2.tar.gz -I sample1_0029-scattered.interval_list-f1r2.tar.gz -I sample1_0008-scattered.interval_list-f1r2.tar.gz -I sample1_0015-scattered.interval_list-f1r2.tar.gz -I sample1_0031-scattered.interval_list-f1r2.tar.gz -I sample1_0024-scattered.interval_list-f1r2.tar.gz -I sample1_0035-scattered.interval_list-f1r2.tar.gz -I sample1_0003-scattered.interval_list-f1r2.tar.gz -I sample1_0030-scattered.interval_list-f1r2.tar.gz -I sample1_0000-scattered.interval_list-f1r2.tar.gz -I sample1_0039-scattered.interval_list-f1r2.tar.gz -I sample1_0006-scattered.interval_list-f1r2.tar.gz -I sample1_0033-scattered.interval_list-f1r2.tar.gz -I sample1_0021-scattered.interval_list-f1r2.tar.gz -I sample1_0012-scattered.interval_list-f1r2.tar.gz -I sample1_0007-scattered.interval_list-f1r2.tar.gz -I sample1_0014-scattered.interval_list-f1r2.tar.gz -I sample1_0037-scattered.interval_list-f1r2.tar.gz -I sample1_0026-scattered.interval_list-f1r2.tar.gz -I sample1_0009-scattered.interval_list-f1r2.tar.gz -I sample1_0005-scattered.interval_list-f1r2.tar.gz -I sample1_0013-scattered.interval_list-f1r2.tar.gz -I sample1_0027-scattered.interval_list-f1r2.tar.gz -I sample1_0017-scattered.interval_list-f1r2.tar.gz -I sample1_0002-scattered.interval_list-f1r2.tar.gz -O sample1_read-orientation-model.tar.gz
zktuong commented 9 months ago

Hi, the --containall run option was removed from the default in nextNEOpi see ebfb100

If you need it for NeoFuse to work, you may put it into:

https://github.com/icbi-lab/nextNEOpi/blob/71046cf9ca4af2560879fd693a24a15f991ae556/conf/process.config#L169-L172

e.g.:

     withName:Neofuse {
        container = 'https://apps-01.i-med.ac.at/images/singularity/NeoFuse_dev_92712e07.sif'
        cpus = 10
        containerOptions = "--containall"
    }

Let me know if this works for you.

thanks! yes this works for me now. Both gatherMutect2VCFs and Neofuse are working now in a single run with the following modifications:

since params.config has includeConfig './process.config' above, so i modified the last parts of params.config to:

singularity {
    enabled = true
    autoMounts = true
    runOptions =  process.containerOptions + " -H " + params.singularityTmpMount + " -B " +  params.singularityAssetsMount + " -B " + params.singularityTmpMount + " -B " + params.resourcesBaseDir + params.singularityHLAHDmount + " -B " + params.databases.IEDB_dir + ":/opt/iedb" + " -B " + params.databases.MHCFLURRY_dir + ":/opt/mhcflurry_data"
}

and process.config to add:

    // default singularity run options
    containerOptions = "--no-home"

    withName:Neofuse {
        container = 'https://apps-01.i-med.ac.at/images/singularity/NeoFuse_dev_92712e07.sif'
        cpus = 60
        containerOptions = "--no-home --containall"
    }

Just need to see if it runs through the rest ok.

riederd commented 8 months ago

I'll implement these changes, and close the issue. Feel free to reopen if it needs more discussion