nf-core / eager

A fully reproducible and state-of-the-art ancient DNA analysis pipeline
https://nf-co.re/eager
MIT License
148 stars 81 forks source link

vcf2genome error #1086

Open dbajpp0 opened 2 months ago

dbajpp0 commented 2 months ago

Dear Eager developer, I found an error on converting vcf file from GATK ug with vcf2geome. Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 3280 at VCF2Genome.runUGAnalysis(VCF2Genome.java:284) at VCF2Genome.(VCF2Genome.java:148) at VCF2Genome.main(VCF2Genome.java:129) I send full error and nextflow log below. I also added info about OS, eager and nextflow as well as full command line input Could you pleae help me to sort it out? Thanks in advance, Joan

Check Documentation

I have checked the following places for your error:

Description of the bug

Pipeline completed with errors- Error executing process > 'vcf2genome (50lib_myotragus_reads_clean_mapped)'

Caused by: Process vcf2genome (50lib_myotragus_reads_clean_mapped) terminated with an error exit status (1)

Command executed:

pigz -d -f -p 1 50lib_myotragus_reads_clean_mapped.unifiedgenotyper.vcf.gz vcf2genome -Xmx100g -draft 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta -draftname "csusUG" -in 50lib_myotragus_reads_clean_mapped.unifiedgenotyper.vcf -minc 5 -minfreq 0.8 -minq 20 -ref Ovis_RNA_centroids95_part1.fa -refMod 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta_refmod.fasta -uncertain 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta_uncertainty.fasta pigz -f -p 1 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta bgzip -@ 1 .vcf

Command exit status: 1

Command output: VCF2Genome (v. 0.91 2017-01-13) by Alexander Herbig (<= v0.84) and Alexander Peltzer (>v0.84) herbig@shh.mpg.de, peltzer@shh.mpg.de

SNP statistics Quality Threshold: 20.0 Coverage Threshold: 5 Minimum SNP allele frequency: 0.8 sample SNP Calls coverage(fold) coverage(percent) refCall allPos noCall discardedRefCall discardedVarCall filteredVarCall unhandledGenotype

Command error: VCF2Genome (v. 0.91 2017-01-13) by Alexander Herbig (<= v0.84) and Alexander Peltzer (>v0.84) herbig@shh.mpg.de, peltzer@shh.mpg.de

SNP statistics Quality Threshold: 20.0 Coverage Threshold: 5 Minimum SNP allele frequency: 0.8 sample SNP Calls coverage(fold) coverage(percent) refCall allPos noCall discardedRefCall discardedVarCall filteredVarCall unhandledGenotype Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 3280 at VCF2Genome.runUGAnalysis(VCF2Genome.java:284) at VCF2Genome.(VCF2Genome.java:148) at VCF2Genome.main(VCF2Genome.java:129)

Work dir: /dades/jpp/data_WG_myotragus/eager/part1/work/5c/ab0ab35e55577260d159fd471ea82f

Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run

Steps to reproduce

Steps to reproduce the behaviour:

  1. Command line: nextflow run ...
  2. See error: Please provide your error message

Expected behaviour

Log files

Have you provided the following extra information/files:

System

Nextflow Installation

Container engine

Additional context

jfy133 commented 1 month ago

I'm very very sorry @dbajpp0 this completely slipped out of my TODO list

Unfortunately this appears to be an error with VCF2Genome.

I would suggest going into the working directory (if you still have the run /dades/jpp/data_WG_myotragus/eager/part1/work/5c/ab0ab35e55577260d159fd471ea82f) and send all the files there and the command eager is trying to execute (vcf2genome -Xmx100g -draft 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta -draftname "csusUG" -in 50lib_myotragus_reads_clean_mapped.unifiedgenotyper.vcf -minc 5 -minfreq 0.8 -minq 20 -ref Ovis_RNA_centroids95_part1.fa -refMod 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta_refmod.fasta -uncertain 50lib_myotragus_reads_clean_mapped_UGVCFcsus.fasta_uncertainty.fasta) to the developer (alexander_herbig@eva.mpg.de).

Altternatively you could try MultiVCFAnalyzer which serves the same purpose but with extra options