nf-core / sarek

Analysis pipeline to detect germline or somatic variants (pre-processing, variant calling and annotation) from WGS / targeted sequencing
https://nf-co.re/sarek
MIT License
408 stars 415 forks source link

use custom snpeff database, such as Candian.aruis #1654

Open ybdong919 opened 2 months ago

ybdong919 commented 2 months ago

Description of the bug

How can I use a custom snpeff reference database, such as candida.auris, in Sarek. This snpeff reference did not included in Sarek. I tried --snpeff_cache, but not work. Can you show me how to do it?

Command used and terminal output

No response

Relevant files

No response

System information

No response

ybdong919 commented 2 months ago

Command used: nextflow run ./sarek -profile singularity --input samplesheet.csv --outdir ./Candida --tools 'freebayes,snpeff' --genome null --igenomes_ignore --fasta ./ref/hs37d5.fa --skip_tools baserecalibrator --snpeff_cache .....//candida_auris.5

Terminal output: N E X T F L O W ~ version 24.04.4

Launching ./sarek/main.nf [trusting_dijkstra] DSL2 - revision: e3d6110e17

WARN: Access to undefined parameter monochromeLogs -- Initialise it to a default value eg. params.monochromeLogs = some_value


                                    ,--./,-.
    ___     __   __   __   ___     /,-._.--~'

|\ | | / / \ |__) |__ } { | \| | \__, \__/ | \ |___ \-.,--, .,._,'


.´ _  `.

/ |`-_ \ _
| | \ -| |__ /\ |) | |/ \ | \ / .| /¯¯\ | \ |_
| \ `|____\´

nf-core/sarek v3.4.4

Core Nextflow options runName : trusting_dijkstra containerEngine : singularity launchDir : /blue/bphl-florida/dongyibo/nf-core/sarek workDir : /blue/bphl-florida/dongyibo/nf-core/sarek/work projectDir : /blue/bphl-florida/dongyibo/nf-core/sarek/sarek userName : dongyibo profile : singularity configFiles :

Input/output options input : samplesheet.csv outdir : ./Candida

Main options tools : freebayes,snpeff skip_tools : baserecalibrator .... .... .... -- Check '.nextflow.log' file for details

executor > local (83) [c9/a8d30f] NFC…:BWAMEM1_INDEX (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [d3/9b5dfb] NFC…ENCEDICTIONARY (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [bb/9ae97e] NFC…SAMTOOLS_FAIDX (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [ad/2b40fb] NFC…S:BUILD_INTERVALS (hs37d5) | 1 of 1 ✔ [50/b5d9e2] NFC…INTERVALS_BED (hs37d5.bed) | 1 of 1 ✔ [54/624f89] NFC…RVAL_SPLIT (4_1-191154276) | 18 of 18 ✔ [ff/c3922b] NFC…INTERVAL_COMBINED (hs37d5) | 1 of 1 ✔ [- ] NFC…RING_DECOMPRESS_TO_FQ_PAIR - [- ] NFC…SPRING_DECOMPRESS_TO_R1_FQ - [- ] NFC…SPRING_DECOMPRESS_TO_R2_FQ - [- ] NFC…NPUT:SAMTOOLS_VIEW_MAP_MAP - [6e/0e407a] NFC…REK:SAREK:FASTQC (S1-L002) | 1 of 1 ✔ [7e/5b706c] NFC…AREK:SAREK:FASTP (S1-L002) | 1 of 1 ✔ [a4/ad2737] NFC…_SENTIEON:BWAMEM1_MEM (S1) | 12 of 12 ✔ [b4/903f14] NFC…:GATK4_MARKDUPLICATES (S1) | 1 of 1 ✔ [fe/613a18] NFC…MTOOLS:SAMTOOLS_STATS (S1) | 1 of 1 ✔ [67/641dc4] NFC…PTH_SAMTOOLS:MOSDEPTH (S1) | 1 of 1 ✔ [f8/cd6441] NFC…G_FREEBAYES:FREEBAYES (S1) | 18 of 18 ✔ [7a/9bb374] NFC…EEBAYES:BCFTOOLS_SORT (S1) | 18 of 18 ✔ [4f/fc66d0] NFC…BAYES:MERGE_FREEBAYES (S1) | 1 of 1 ✔ [42/fec85a] NFC…FTOOLS:BCFTOOLS_STATS (S1) | 1 of 1 ✔ [dd/de5de9] NFC…S:VCFTOOLS_TSTV_COUNT (S1) | 1 of 1 ✔ [b6/af9e8f] NFC…LS:VCFTOOLS_TSTV_QUAL (S1) | 1 of 1 ✔ [81/ec6c50] NFC…OOLS:VCFTOOLS_SUMMARY (S1) | 1 of 1 ✔ [f1/aa0a68] NFC…_SNPEFF:SNPEFF_SNPEFF (S1) | 1 of 1, failed: 1 ✘ Plus 24 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] Pulling Singularity image https://depot.galaxyproject.org/singularity/vcftools:0.1.16--he513fc3_4 [cache /blue/bphl-florida/dongyibo/nf-core/sarek/work/singularity/depot.galaxyproject.org-singularity-vcftools-0.1.16--he513fc3_4.img] Execution cancelled -- Finishing pending tasks before exit ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (S1)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (S1) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ null.null \ -nodownload -canon -v \ -csvStats S1.freebayes_snpEff.csv \ -dataDir ${PWD}/candida_auris.5 \ S1.freebayes.vcf.gz \

S1.freebayes_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: 'null.null' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config java.lang.RuntimeException: Property: 'null.null.genome' not found at org.snpeff.interval.Genome.(Genome.java:104) at org.snpeff.snpEffect.Config.readGenomeConfig(Config.java:784) at org.snpeff.snpEffect.Config.readConfig(Config.java:751) at org.snpeff.snpEffect.Config.init(Config.java:529) at org.snpeff.snpEffect.Config.(Config.java:116) at org.snpeff.SnpEff.loadConfig(SnpEff.java:429) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:889) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek/work/f1/aa0a689fee1dc19e95facea7a517d5

Tip: view the complete command output by changing to the process work dir and entering the command cat .command.out

-- Check '.nextflow.log' file for details ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting

-- Check '.nextflow.log' file for details

executor > local (83) [c9/a8d30f] NFC…:BWAMEM1_INDEX (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [d3/9b5dfb] NFC…ENCEDICTIONARY (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [bb/9ae97e] NFC…SAMTOOLS_FAIDX (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [ad/2b40fb] NFC…S:BUILD_INTERVALS (hs37d5) | 1 of 1 ✔ [50/b5d9e2] NFC…INTERVALS_BED (hs37d5.bed) | 1 of 1 ✔ [54/624f89] NFC…RVAL_SPLIT (4_1-191154276) | 18 of 18 ✔ [ff/c3922b] NFC…INTERVAL_COMBINED (hs37d5) | 1 of 1 ✔ [- ] NFC…RING_DECOMPRESS_TO_FQ_PAIR - [- ] NFC…SPRING_DECOMPRESS_TO_R1_FQ - [- ] NFC…SPRING_DECOMPRESS_TO_R2_FQ - [- ] NFC…NPUT:SAMTOOLS_VIEW_MAP_MAP - [6e/0e407a] NFC…REK:SAREK:FASTQC (S1-L002) | 1 of 1 ✔ [7e/5b706c] NFC…AREK:SAREK:FASTP (S1-L002) | 1 of 1 ✔ [a4/ad2737] NFC…_SENTIEON:BWAMEM1_MEM (S1) | 12 of 12 ✔ [b4/903f14] NFC…:GATK4_MARKDUPLICATES (S1) | 1 of 1 ✔ [fe/613a18] NFC…MTOOLS:SAMTOOLS_STATS (S1) | 1 of 1 ✔ [67/641dc4] NFC…PTH_SAMTOOLS:MOSDEPTH (S1) | 1 of 1 ✔ [f8/cd6441] NFC…G_FREEBAYES:FREEBAYES (S1) | 18 of 18 ✔ [7a/9bb374] NFC…EEBAYES:BCFTOOLS_SORT (S1) | 18 of 18 ✔ [4f/fc66d0] NFC…BAYES:MERGE_FREEBAYES (S1) | 1 of 1 ✔ [42/fec85a] NFC…FTOOLS:BCFTOOLS_STATS (S1) | 1 of 1 ✔ [dd/de5de9] NFC…S:VCFTOOLS_TSTV_COUNT (S1) | 1 of 1 ✔ [b6/af9e8f] NFC…LS:VCFTOOLS_TSTV_QUAL (S1) | 1 of 1 ✔ [81/ec6c50] NFC…OOLS:VCFTOOLS_SUMMARY (S1) | 1 of 1 ✔ [f1/aa0a68] NFC…_SNPEFF:SNPEFF_SNPEFF (S1) | 1 of 1, failed: 1 ✘ Plus 24 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] Pulling Singularity image https://depot.galaxyproject.org/singularity/vcftools:0.1.16--he513fc3_4 [cache /blue/bphl-florida/dongyibo/nf-core/sarek/work/singularity/depot.galaxyproject.org-singularity-vcftools-0.1.16--he513fc3_4.img] Execution cancelled -- Finishing pending tasks before exit -[nf-core/sarek] Pipeline completed with errors- ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (S1)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (S1) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ null.null \ -nodownload -canon -v \ -csvStats S1.freebayes_snpEff.csv \ -dataDir ${PWD}/candida_auris.5 \ S1.freebayes.vcf.gz \

S1.freebayes_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: 'null.null' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config java.lang.RuntimeException: Property: 'null.null.genome' not found at org.snpeff.interval.Genome.(Genome.java:104) at org.snpeff.snpEffect.Config.readGenomeConfig(Config.java:784) at org.snpeff.snpEffect.Config.readConfig(Config.java:751) at org.snpeff.snpEffect.Config.init(Config.java:529) at org.snpeff.snpEffect.Config.(Config.java:116) at org.snpeff.SnpEff.loadConfig(SnpEff.java:429) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:889) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek/work/f1/aa0a689fee1dc19e95facea7a517d5

Tip: view the complete command output by changing to the process work dir and entering the command cat .command.out

-- Check '.nextflow.log' file for details ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting

-- Check '.nextflow.log' file for details

executor > local (83) [c9/a8d30f] NFC…:BWAMEM1_INDEX (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [d3/9b5dfb] NFC…ENCEDICTIONARY (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [bb/9ae97e] NFC…SAMTOOLS_FAIDX (hs37d5.fa) | 1 of 1 ✔ [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [ad/2b40fb] NFC…S:BUILD_INTERVALS (hs37d5) | 1 of 1 ✔ [50/b5d9e2] NFC…INTERVALS_BED (hs37d5.bed) | 1 of 1 ✔ [54/624f89] NFC…RVAL_SPLIT (4_1-191154276) | 18 of 18 ✔ [ff/c3922b] NFC…INTERVAL_COMBINED (hs37d5) | 1 of 1 ✔ [- ] NFC…RING_DECOMPRESS_TO_FQ_PAIR - [- ] NFC…SPRING_DECOMPRESS_TO_R1_FQ - [- ] NFC…SPRING_DECOMPRESS_TO_R2_FQ - [- ] NFC…NPUT:SAMTOOLS_VIEW_MAP_MAP - [6e/0e407a] NFC…REK:SAREK:FASTQC (S1-L002) | 1 of 1 ✔ [7e/5b706c] NFC…AREK:SAREK:FASTP (S1-L002) | 1 of 1 ✔ [a4/ad2737] NFC…_SENTIEON:BWAMEM1_MEM (S1) | 12 of 12 ✔ [b4/903f14] NFC…:GATK4_MARKDUPLICATES (S1) | 1 of 1 ✔ [fe/613a18] NFC…MTOOLS:SAMTOOLS_STATS (S1) | 1 of 1 ✔ [67/641dc4] NFC…PTH_SAMTOOLS:MOSDEPTH (S1) | 1 of 1 ✔ [f8/cd6441] NFC…G_FREEBAYES:FREEBAYES (S1) | 18 of 18 ✔ [7a/9bb374] NFC…EEBAYES:BCFTOOLS_SORT (S1) | 18 of 18 ✔ [4f/fc66d0] NFC…BAYES:MERGE_FREEBAYES (S1) | 1 of 1 ✔ [42/fec85a] NFC…FTOOLS:BCFTOOLS_STATS (S1) | 1 of 1 ✔ [dd/de5de9] NFC…S:VCFTOOLS_TSTV_COUNT (S1) | 1 of 1 ✔ [b6/af9e8f] NFC…LS:VCFTOOLS_TSTV_QUAL (S1) | 1 of 1 ✔ [81/ec6c50] NFC…OOLS:VCFTOOLS_SUMMARY (S1) | 1 of 1 ✔ [f1/aa0a68] NFC…_SNPEFF:SNPEFF_SNPEFF (S1) | 1 of 1, failed: 1 ✘ Plus 24 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] Pulling Singularity image https://depot.galaxyproject.org/singularity/vcftools:0.1.16--he513fc3_4 [cache /blue/bphl-florida/dongyibo/nf-core/sarek/work/singularity/depot.galaxyproject.org-singularity-vcftools-0.1.16--he513fc3_4.img] Execution cancelled -- Finishing pending tasks before exit -[nf-core/sarek] Pipeline completed with errors- ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (S1)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (S1) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ null.null \ -nodownload -canon -v \ -csvStats S1.freebayes_snpEff.csv \ -dataDir ${PWD}/candida_auris.5 \ S1.freebayes.vcf.gz \

S1.freebayes_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: 'null.null' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config java.lang.RuntimeException: Property: 'null.null.genome' not found at org.snpeff.interval.Genome.(Genome.java:104) at org.snpeff.snpEffect.Config.readGenomeConfig(Config.java:784) at org.snpeff.snpEffect.Config.readConfig(Config.java:751) at org.snpeff.snpEffect.Config.init(Config.java:529) at org.snpeff.snpEffect.Config.(Config.java:116) at org.snpeff.SnpEff.loadConfig(SnpEff.java:429) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:889) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek/work/f1/aa0a689fee1dc19e95facea7a517d5

Tip: view the complete command output by changing to the process work dir and entering the command cat .command.out

-- Check '.nextflow.log' file for details ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting

-- Check '.nextflow.log' file for details

asp8200 commented 2 months ago

I think that the problem is that the "genome" for snpeff is set to null.null:

00:00:00 Reading configuration file 'snpEff.config'. Genome: 'null.null'
00:00:00 Reading config file: snpEff.config
00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config
java.lang.RuntimeException: Property: 'null.null.genome' not found

AFAIKT, it means that the value of $db here is null.null, and I traced that back to this line.

Looking at the snpeff-cache that you downloaded, you might be able to figure out what appropriate values for snpeff_genome and snpeff_db should be. (In cache of GRCh38, I find that my snpeff-cache contains a subfolder called GRCh38.105 and so we got snpeff_genome=GRCh38 and snpeff_db=105.) Once you got those values figured out, try using those value for setting --snpeff_genome and --snpeff_db directly in the nextflow command.

ybdong919 commented 2 months ago

According to snpEff.config, Candida auris database is below:

Screenshot 2024-09-17 003833 Screenshot 2024-09-17 003833

How do I set snpeff_genome and snpeff_db?

maxulysse commented 1 month ago

Hi @ybdong919, Usually what I do to get the snpeff info is:

docker run quay.io/biocontainers/snpeff:5.1--hdfd78af_2 snpEff databases | grep Homo_sapiens
GRCh37.75                                                       Homo_sapiens                                                                                                [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_GRCh37.75.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_GRCh37.75.zip]
GRCh38.105                                                      Homo_sapiens                                                                                                [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_GRCh38.105.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_GRCh38.105.zip]
GRCh38.99                                                       Homo_sapiens                                                                                                [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_GRCh38.99.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_GRCh38.99.zip]
hg19                                                            Homo_sapiens (UCSC)                                                                                         [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_hg19.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_hg19.zip]
hg19kg                                                          Homo_sapiens (UCSC KnownGenes)                                                                              [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_hg19kg.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_hg19kg.zip]
hg38                                                            Homo_sapiens (UCSC)                                                                                         [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_hg38.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_hg38.zip]
hg38kg                                                          Homo_sapiens (UCSC KnownGenes)                                                                              [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_hg38kg.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_hg38kg.zip]
testHg19ChrM                                                    Homo_sapiens (UCSC)                                                                                         [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_testHg19ChrM.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_testHg19ChrM.zip]

The first column is the info that I used to make the snpeff_genome.snpeff_db.

In some cases there's no snpeff_db, and I don't think sarek handles that for now.

I'll try to have a more in depth look and see how to deal with that within sarek.

asp8200 commented 1 month ago

Thanks, @maxulysse. With the command docker run quay.io/biocontainers/snpeff:5.1--hdfd78af_2 snpEff databases | grep candida_auris, I got

_candida_auris_gca_001189475                                    _candida_auris_gca_001189475
_candida_auris_gca_002759435                                    _candida_auris_gca_002759435
_candida_auris_gca_002775015                                    _candida_auris_gca_002775015
_candida_auris_gca_003013715                                    _candida_auris_gca_003013715
_candida_auris_gca_003014415                                    _candida_auris_gca_003014415
_candida_auris_gca_007168705                                    _candida_auris_gca_007168705
_candida_auris_gca_008275145                                    _candida_auris_gca_008275145

So perhaps it would work if @ybdong919 set --snpeff_genome _candida_auris_gca and --snpeff_db 001189475 in his nextflow command, and fix this line accordingly, that is, change it to

params.snpeff_genome ? "${params.snpeff_genome}_${params.snpeff_db}" : "${params.genome}.${params.snpeff_db}",

(Notice a dot (.) was changed to an underscore (_).)

Or something like that?

ybdong919 commented 1 month ago

@asp8200 I made the change from . to in the line you mentioned. However, it does not work. According to the log, it looks like more places that need to be changed from ( . ) to ( ).

The command is : nextflow run ./sarek -profile singularity --input samplesheet.csv --outdir ./Candida --tools 'snpeff' --step annotate --genome null --igenomes_ignore --skip_tools baserecalibrator --snpeff_cache //nf-core/sarek/data --snpeff_genome _candida_auris --snpeff_db gca_001189475

The termianl output is: N E X T F L O W ~ version 24.04.4

Launching ./sarek/main.nf [thirsty_gautier] DSL2 - revision: e3d6110e17

WARN: Access to undefined parameter monochromeLogs -- Initialise it to a default value eg. params.monochromeLogs = some_value


                                    ,--./,-.
    ___     __   __   __   ___     /,-._.--~'

|\ | | / / \ |__) |__ } { | \| | \__, \__/ | \ |___ \-.,--, .,._,'


.´ _  `.

/ |`-_ \ _
| | \ -| |__ /\ |) | |/ \ | \ / .| /¯¯\ | \ |_
| \ `|____\´

nf-core/sarek v3.4.4

Core Nextflow options runName : thirsty_gautier containerEngine : singularity launchDir : /blue/bphl-florida/dongyibo/nf-core/sarek workDir : /blue/bphl-florida/dongyibo/nf-core/sarek/work projectDir : /blue/bphl-florida/dongyibo/nf-core/sarek/sarek userName : dongyibo profile : singularity configFiles :

Input/output options input : samplesheet.csv step : annotate outdir : ./Candida

Main options tools : snpeff skip_tools : baserecalibrator

Reference genome options genome : null snpeff_db : gca_001189475 snpeff_genome : _candida_auris igenomes_ignore : true snpeff_cache : /blue/bphl-florida/dongyibo/nf-core/sarek/data

Generic options validationLenientMode: true

!! Only displaying parameters that differ from the pipeline defaults !!

If you use nf-core/sarek for your analysis please cite:

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - Path provided with SnpEff cache is invalid. Make sure there is a directory named _candida_auris.gca_001189475 in /blue/bphl-florida/dongyibo/nf-core/sarek/data./nPlease refer to https://nf-co.re/sarek/docs/usage/#how-to-customise-snpeff-and-vep-annotation for more information.

asp8200 commented 1 month ago

It looks to me like you haven't made the change I mentioned above. The pipeline is looking for a directory named _candida_auris.gca_001189475 which is what I would expect it to do, if you haven't changed that line 877.

ybdong919 commented 1 month ago

I did change. See my line 877 below:

        VCF_ANNOTATE_ALL(
            vcf_to_annotate.map{meta, vcf -> [ meta + [ file_name: vcf.baseName ], vcf ] },
            vep_fasta,
            params.tools,
            //params.snpeff_genome ? "${params.snpeff_genome}.${params.snpeff_db}" : "${params.genome}.${params.snpeff_db}",
            **params.snpeff_genome ? "${params.snpeff_genome}_${params.snpeff_db}" : "${params.genome}.${params.snpeff_db}",**
            snpeff_cache,
            vep_genome,
            vep_species,
            vep_cache_version,
            vep_cache,
            vep_extra_files,
            bcftools_annotations,
            bcftools_annotations_tbi,
            bcftools_header_lines)

        // Gather used softwares versions
        versions = versions.mix(VCF_ANNOTATE_ALL.out.versions)
        reports = reports.mix(VCF_ANNOTATE_ALL.out.reports)
    }
}
ybdong919 commented 1 month ago
        VCF_ANNOTATE_ALL(
            vcf_to_annotate.map{meta, vcf -> [ meta + [ file_name: vcf.baseName ], vcf ] },
            vep_fasta,
            params.tools,
            //params.snpeff_genome ? "${params.snpeff_genome}.${params.snpeff_db}" : "${params.genome}.${params.snpeff_db}",
            params.snpeff_genome ? "${params.snpeff_genome}_${params.snpeff_db}" : "${params.genome}.${params.snpeff_db}",
            snpeff_cache,
            vep_genome,
            vep_species,
            vep_cache_version,
            vep_cache,
            vep_extra_files,
            bcftools_annotations,
            bcftools_annotations_tbi,
            bcftools_header_lines)

        // Gather used softwares versions
        versions = versions.mix(VCF_ANNOTATE_ALL.out.versions)
        reports = reports.mix(VCF_ANNOTATE_ALL.out.reports)
    }
}
asp8200 commented 1 month ago

Very strange 🤔

Could you just - as a sanity check - try something like

        VCF_ANNOTATE_ALL(
            vcf_to_annotate.map{meta, vcf -> [ meta + [ file_name: vcf.baseName ], vcf ] },
            vep_fasta,
            params.tools,
            "foobar",
            snpeff_cache,
            vep_genome,
            vep_species,
            vep_cache_version,
            vep_cache,
            vep_extra_files,
            bcftools_annotations,
            bcftools_annotations_tbi,
            bcftools_header_lines)

and rerun your nf-command. Then I would expect that you get an error message like

Path provided with SnpEff cache is invalid.
Make sure there is a directory named foobar in /blue/bphl-florida/dongyibo/nf-core/sarek/data./nPlease ...

If you indeed get that, then perhaps just hardcode the snpeff-argument for now, that is, try

        VCF_ANNOTATE_ALL(
            vcf_to_annotate.map{meta, vcf -> [ meta + [ file_name: vcf.baseName ], vcf ] },
            vep_fasta,
            params.tools,
            "_candida_auris_gca_001189475",
            snpeff_cache,
            vep_genome,
            vep_species,
            vep_cache_version,
            vep_cache,
            vep_extra_files,
            bcftools_annotations,
            bcftools_annotations_tbi,
            bcftools_header_lines)

Given the problems your facing with getting this to work for "Candian aruis", I think we might try to improve on the setup of the snpeff-cache (and possibly the vep-cache) in Sarek in a future release.

asp8200 commented 1 month ago

@ybdong919 : Okay, so I've been trying to reproduce the error and it turns out that you also need to change a dot to an underscore here.

asp8200 commented 1 month ago

@ybdong919 : On this dev-branch, I tried solving the problem you encountered by retiring snpeff_genome and just using snpeff_db. Could you try to test my branch on your Candida Auris data? Cheers

ybdong919 commented 1 month ago

s.snpeff_db}" : "${params.genom

It did not work. See below:

Command:

nextflow run ./sarek -profile singularity --input samplesheet.csv --outdir ./Candida --tools 'snpeff' --step annotate --genome null --igenomes_ignore --skip_tools baserecalibrator --snpeff_cache //nf-core/sarek/data --snpeff_db _candida_aurisgca_001189475

Output of terminal: N E X T F L O W ~ version 24.04.4

Launching ./sarek/main.nf [extravagant_stallman] DSL2 - revision: e3d6110e17

WARN: Access to undefined parameter monochromeLogs -- Initialise it to a default value eg. params.monochromeLogs = some_value


                                    ,--./,-.
    ___     __   __   __   ___     /,-._.--~'

|\ | | / / \ |__) |__ } { | \| | \__, \__/ | \ |___ \-.,--, .,._,'


.´ _  `.

/ |`-_ \ _
| | \ -| |__ /\ |) | |/ \ | \ / .| /¯¯\ | \ |_
| \ `|____\´

nf-core/sarek v3.4.4

Core Nextflow options runName : extravagant_stallman containerEngine : singularity launchDir : /blue/bphl-florida/dongyibo/nf-core/sarek workDir : /blue/bphl-florida/dongyibo/nf-core/sarek/work projectDir : /blue/bphl-florida/dongyibo/nf-core/sarek/sarek userName : dongyibo profile : singularity configFiles :

Input/output options input : samplesheet.csv step : annotate outdir : ./Candida

Main options tools : snpeff skip_tools : baserecalibrator

Reference genome options genome : null snpeff_db : _candida_aurisgca_001189475 igenomes_ignore : true snpeff_cache : /blue/bphl-florida/dongyibo/nf-core/sarek/data

Generic options validationLenientMode: true

!! Only displaying parameters that differ from the pipeline defaults !!

If you use nf-core/sarek for your analysis please cite:

asp8200 commented 1 month ago

@ybdong919 : Could you try running your command using the version of Sarek from my branch modify_snpeff_cache_setup?

ybdong919 commented 1 month ago

Yes, I did. but not work. I think more places need to be debugged.

asp8200 commented 1 month ago

Hmmm ... That is strange 🤔 @maxulysse just merged my PR to the dev-branch of the nf-core/sarek-repo, and I just ran the following test on that branch:

nextflow run main.nf -profile singularity --input tests/csv/3.0/vcf_single.csv --outdir results --tools snpeff --step annotate --genome null --igenomes_ignore --snpeff_cache /home/ubuntu/dev/sarek/cache/snpeff_cache --snpeff_db _candida_auris_gca_001189475

Here the folder /home/ubuntu/dev/sarek/cache/snpeff_cache contains the subfolder _candida_auris_gca_001189475.

Works fine for me:

image

And it produces test_snpEff.ann.vcf.gz, which is placed in results/annotation/test, and the vcf contains the line

##SnpEffCmd="SnpEff  _candida_auris_gca_001189475 -csvStats test_snpEff.csv test.vcf.gz "
ybdong919 commented 3 weeks ago

How and where should the pipeline scripts be changed if I want to use it to analyze other microbe genomes?

ybdong919 commented 3 weeks ago

Hmmm ... That is strange 🤔 @maxulysse just merged my PR to the dev-branch of the nf-core/sarek-repo, and I just ran the following test on that branch:

nextflow run main.nf -profile singularity --input tests/csv/3.0/vcf_single.csv --outdir results --tools snpeff --step annotate --genome null --igenomes_ignore --snpeff_cache /home/ubuntu/dev/sarek/cache/snpeff_cache --snpeff_db _candida_auris_gca_001189475

Here the folder /home/ubuntu/dev/sarek/cache/snpeff_cache contains the subfolder _candida_auris_gca_001189475.

Works fine for me:

image

And it produces test_snpEff.ann.vcf.gz, which is placed in results/annotation/test, and the vcf contains the line

##SnpEffCmd="SnpEff  _candida_auris_gca_001189475 -csvStats test_snpEff.csv test.vcf.gz "

This is my running log according to your command:

N E X T F L O W ~ version 24.04.4

Launching ./sarek/main.nf [elated_fermat] DSL2 - revision: 1eeed174dc

WARN: Access to undefined parameter monochromeLogs -- Initialise it to a default value eg. params.monochromeLogs = some_value


                                    ,--./,-.
    ___     __   __   __   ___     /,-._.--~'

|\ | | / / \ |__) |__ } { | \| | \__, \__/ | \ |___ \-.,--, .,._,'


.´ _  `.

/ |`-_ \ _
| | \ -| |__ /\ |) | |/ \ | \ / .| /¯¯\ | \ |_
| \ `|____\´

nf-core/sarek v3.5.0dev

Core Nextflow options runName : elated_fermat containerEngine : singularity launchDir : /blue/bphl-florida/dongyibo/nf-core/sarek3 workDir : /blue/bphl-florida/dongyibo/nf-core/sarek3/work projectDir : /blue/bphl-florida/dongyibo/nf-core/sarek3/sarek userName : dongyibo profile : singularity configFiles :

Input/output options input : ./sarek/tests/csv/3.0/vcf_single.csv step : annotate outdir : results

Main options tools : snpeff

Reference genome options genome : null snpeff_db : _candida_auris_gca_001189475 igenomes_ignore : true snpeff_cache : /blue/bphl-florida/dongyibo/nf-core/sarek3/data/snpeff_cache

Generic options validationLenientMode: true

!! Only displaying parameters that differ from the pipeline defaults !!

If you use nf-core/sarek for your analysis please cite:

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS -

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM -

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT -

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - Plus 2 more processes waiting for tasks…

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - Plus 4 more processes waiting for tasks…

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - Plus 4 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location

[- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - [- ] NFC…OTATE_SNPEFF:SNPEFF_SNPEFF | 0 of 1 Plus 3 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location

executor > local (1) [- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - [80/062b1d] NFC…NPEFF:SNPEFF_SNPEFF (test) | 0 of 1 Plus 3 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location

executor > local (1) [- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - [80/062b1d] NFC…NPEFF:SNPEFF_SNPEFF (test) | 0 of 1 Plus 3 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ _candida_auris_gca_001189475 \ -nodownload -canon -v \ -csvStats test_snpEff.csv \ -dataDir ${PWD}/snpeff_cache \ test.vcf.gz \

test_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: '_candida_auris_gca_001189475' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config 00:00:00 done 00:00:00 Reading database for genome version '_candida_auris_gca_001189475' from file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' (this might take a while) java.lang.RuntimeException: Database file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' is not compatible with this program version: Database version : '5.2' Program version : '5.1' Compatible versions : '[5.1, 5.0]' Try installing the appropriate database. at org.snpeff.serializer.MarkerSerializer.load(MarkerSerializer.java:140) at org.snpeff.snpEffect.SnpEffectPredictor.load(SnpEffectPredictor.java:52) at org.snpeff.snpEffect.Config.loadSnpEffectPredictor(Config.java:680) at org.snpeff.SnpEff.loadDb(SnpEff.java:499) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:890) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Checking for updates... 00:00:02 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/80/062b1debb95a343f3b3c73ce8944fe

Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run

-- Check '.nextflow.log' file for details

executor > local (1) [- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - [80/062b1d] NFC…NPEFF:SNPEFF_SNPEFF (test) | 1 of 1, failed: 1 ✘ Plus 3 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] Execution cancelled -- Finishing pending tasks before exit Pulling Singularity image https://depot.galaxyproject.org/singularity/multiqc:1.21--pyhdfd78af_0 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-multiqc-1.21--pyhdfd78af_0.img] WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ _candida_auris_gca_001189475 \ -nodownload -canon -v \ -csvStats test_snpEff.csv \ -dataDir ${PWD}/snpeff_cache \ test.vcf.gz \

test_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: '_candida_auris_gca_001189475' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config 00:00:00 done 00:00:00 Reading database for genome version '_candida_auris_gca_001189475' from file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' (this might take a while) java.lang.RuntimeException: Database file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' is not compatible with this program version: Database version : '5.2' Program version : '5.1' Compatible versions : '[5.1, 5.0]' Try installing the appropriate database. at org.snpeff.serializer.MarkerSerializer.load(MarkerSerializer.java:140) at org.snpeff.snpEffect.SnpEffectPredictor.load(SnpEffectPredictor.java:52) at org.snpeff.snpEffect.Config.loadSnpEffectPredictor(Config.java:680) at org.snpeff.SnpEff.loadDb(SnpEff.java:499) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:890) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Checking for updates... 00:00:02 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/80/062b1debb95a343f3b3c73ce8944fe

Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run

-- Check '.nextflow.log' file for details ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting

-- Check '.nextflow.log' file for details

executor > local (1) [- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - [80/062b1d] NFC…NPEFF:SNPEFF_SNPEFF (test) | 1 of 1, failed: 1 ✘ Plus 3 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] Execution cancelled -- Finishing pending tasks before exit Pulling Singularity image https://depot.galaxyproject.org/singularity/multiqc:1.21--pyhdfd78af_0 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-multiqc-1.21--pyhdfd78af_0.img] -[nf-core/sarek] Pipeline completed with errors- WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ _candida_auris_gca_001189475 \ -nodownload -canon -v \ -csvStats test_snpEff.csv \ -dataDir ${PWD}/snpeff_cache \ test.vcf.gz \

test_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: '_candida_auris_gca_001189475' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config 00:00:00 done 00:00:00 Reading database for genome version '_candida_auris_gca_001189475' from file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' (this might take a while) java.lang.RuntimeException: Database file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' is not compatible with this program version: Database version : '5.2' Program version : '5.1' Compatible versions : '[5.1, 5.0]' Try installing the appropriate database. at org.snpeff.serializer.MarkerSerializer.load(MarkerSerializer.java:140) at org.snpeff.snpEffect.SnpEffectPredictor.load(SnpEffectPredictor.java:52) at org.snpeff.snpEffect.Config.loadSnpEffectPredictor(Config.java:680) at org.snpeff.SnpEff.loadDb(SnpEff.java:499) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:890) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Checking for updates... 00:00:02 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/80/062b1debb95a343f3b3c73ce8944fe

Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run

-- Check '.nextflow.log' file for details ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting

-- Check '.nextflow.log' file for details

executor > local (1) [- ] NFC…EPARE_GENOME:BWAMEM1_INDEX - [- ] NFC…EPARE_GENOME:BWAMEM2_INDEX - [- ] NFC…E_GENOME:DRAGMAP_HASHTABLE - [- ] NFC…4_CREATESEQUENCEDICTIONARY - [- ] NFC…E_GENOME:MSISENSORPRO_SCAN - [- ] NFC…PARE_GENOME:SAMTOOLS_FAIDX - [- ] NFC…TABIX_BCFTOOLS_ANNOTATIONS - [- ] NFC…PREPARE_GENOME:TABIX_DBSNP - [- ] NFC…ME:TABIX_GERMLINE_RESOURCE - [- ] NFC…RE_GENOME:TABIX_KNOWN_SNPS - [- ] NFC…_GENOME:TABIX_KNOWN_INDELS - [- ] NFC…K:PREPARE_GENOME:TABIX_PON - [- ] NFC…ZIPTABIX_INTERVAL_COMBINED - [- ] NFC…AM_QC_RECAL:SAMTOOLS_STATS - [- ] NFC…EQC:CRAM_QC_RECAL:MOSDEPTH - [- ] NFC…CHECKMATE:BCFTOOLS_MPILEUP - [- ] NFC…CHECKMATE:NGSCHECKMATE_NCM - [- ] NFC…LS_VCFTOOLS:BCFTOOLS_STATS - [- ] NFC…FTOOLS:VCFTOOLS_TSTV_COUNT - [- ] NFC…CFTOOLS:VCFTOOLS_TSTV_QUAL - [80/062b1d] NFC…NPEFF:SNPEFF_SNPEFF (test) | 1 of 1, failed: 1 ✘ Plus 3 more processes waiting for tasks… Pulling Singularity image https://depot.galaxyproject.org/singularity/snpeff:5.1--hdfd78af_2 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-snpeff-5.1--hdfd78af_2.img] Execution cancelled -- Finishing pending tasks before exit Pulling Singularity image https://depot.galaxyproject.org/singularity/multiqc:1.21--pyhdfd78af_0 [cache /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity/depot.galaxyproject.org-singularity-multiqc-1.21--pyhdfd78af_0.img] -[nf-core/sarek] Pipeline completed with errors- WARN: Singularity cache directory has not been defined -- Remote image will be stored in the path: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/singularity -- Use the environment variable NXF_SINGULARITY_CACHEDIR to specify a different location ERROR ~ Error executing process > 'NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test)'

Caused by: Process NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF (test) terminated with an error exit status (255)

Command executed:

snpEff \ -Xmx29491M \ _candida_auris_gca_001189475 \ -nodownload -canon -v \ -csvStats test_snpEff.csv \ -dataDir ${PWD}/snpeff_cache \ test.vcf.gz \

test_snpEff.ann.vcf

cat <<-END_VERSIONS > versions.yml "NFCORE_SAREK:SAREK:VCF_ANNOTATE_ALL:VCF_ANNOTATE_SNPEFF:SNPEFF_SNPEFF": snpeff: $(echo $(snpEff -version 2>&1) | cut -f 2 -d ' ') END_VERSIONS

Command exit status: 255

Command output: (empty)

Command error: INFO: Environment variable SINGULARITYENV_TMPDIR is set, but APPTAINERENV_TMPDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_TASK_WORKDIR is set, but APPTAINERENV_NXF_TASK_WORKDIR is preferred INFO: Environment variable SINGULARITYENV_NXF_DEBUG is set, but APPTAINERENV_NXF_DEBUG is preferred 00:00:00 SnpEff version SnpEff 5.1d (build 2022-04-19 15:49), by Pablo Cingolani 00:00:00 Command: 'ann' 00:00:00 Reading configuration file 'snpEff.config'. Genome: '_candida_auris_gca_001189475' 00:00:00 Reading config file: snpEff.config 00:00:00 Reading config file: /usr/local/share/snpeff-5.1-2/snpEff.config 00:00:00 done 00:00:00 Reading database for genome version '_candida_auris_gca_001189475' from file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' (this might take a while) java.lang.RuntimeException: Database file 'snpeff_cache/_candida_auris_gca_001189475/snpEffectPredictor.bin' is not compatible with this program version: Database version : '5.2' Program version : '5.1' Compatible versions : '[5.1, 5.0]' Try installing the appropriate database. at org.snpeff.serializer.MarkerSerializer.load(MarkerSerializer.java:140) at org.snpeff.snpEffect.SnpEffectPredictor.load(SnpEffectPredictor.java:52) at org.snpeff.snpEffect.Config.loadSnpEffectPredictor(Config.java:680) at org.snpeff.SnpEff.loadDb(SnpEff.java:499) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:890) at org.snpeff.snpEffect.commandLine.SnpEffCmdEff.run(SnpEffCmdEff.java:875) at org.snpeff.SnpEff.run(SnpEff.java:1141) at org.snpeff.SnpEff.main(SnpEff.java:160) 00:00:00 Logging 00:00:01 Checking for updates... 00:00:02 Done.

Work dir: /blue/bphl-florida/dongyibo/nf-core/sarek3/work/80/062b1debb95a343f3b3c73ce8944fe

Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run

-- Check '.nextflow.log' file for details ERROR ~ Pipeline failed. Please refer to troubleshooting docs: https://nf-co.re/docs/usage/troubleshooting

-- Check '.nextflow.log' file for details

ybdong919 commented 3 weeks ago

Would you share the version 5.0 or 5.1 of Candida auris? The two versions are not available when I try to download them by "snpeff --download Candida auris" :

[dongyibo@login9 nf-core]$ snpeff -download Candida_auris 00:00:00 ERROR while connecting to https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_Candida_auris.zip 00:00:00 ERROR while connecting to https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_Candida_auris.zip FATAL ERROR: Failed to download database from [https://snpeff.blob.core.windows.net/databases/v5_1/snpEff_v5_1_Candida_auris.zip, https://snpeff.blob.core.windows.net/databases/v5_0/snpEff_v5_0_Candida_auris.zip]