Closed wdermauw closed 4 years ago
@nalinigans @mlathara your thoughts on this issue?
@wdermauw Did you confirm prior to the failed GenotypeGVCFs
run that the ABchroneALL
directory existed in the working directory, was readable, non-empty, etc.? The error you are getting is a very generic I/O error, so it's difficult to diagnose the problem without a bit more information.
@wdermauw Can you paste the contents of /data/xxxxxx/ABchroneALL/callset.json
. This error seems to suggest something is amiss with that...maybe one of the sample names was somehow mangled...
Caused by: java.io.IOException: GenomicsDB JNI Error: GenomicsDBConfigException : Syntax error in JSON file /data/xxxxxx/ABchroneALL/callset.json
@mlathara @droazen thank you for the quick reply...
i checked the callset.json and you are correct, sample names were somehow mingled...my sincere apologies
after the mapping/variant calling (using a pipeline), I noticed that two g.vcf.gz files had a space in their filename. I removed the space in the file name (X g.vcf.gz->X.g.vcf.gz) and continued with GenomicsDB, but apparently, GenomicsDb extracts the sample name not from the file name but from somewhere else (as in the callset.json file I found that the file mentioned above was called as X� and X�_stream, while this was not the case for all other files)
not using the vcf.gz files having a � symbol in callset.json solved the problem (GenotypeGVCF not throwing an error).
thank you for helping out again, my apologies for the inconvenience caused
Closing as resolved
i have meet the same question . l want to know how to cue this question ? my sincere apologies
Closing as resolved
i have meet the same question . l want to know how to cue this question ? my sincere apologies
Hi
After creating a GenomicsDB with 36 gvcf withGenomicsDBimport, I used GEnotypeGVCF on the GenomicsDB folder but get the following error: "ERROR: Couldn't create GenomicsDBFeatureReader". See below for the complete output.
FYI, I don't get this error when importing only six samples in GenomicsDB and then running GenotypeGVCF.
FYI, also added "TILEDB_DISABLE_FILE_LOCKING=1" to environment (export TILEDB_DISABLE_FILE_LOCKING=1) before executing GenomicsDBimport command but that dit not help either.
Any suggestion would be highly appreciated.
GenotypeVCF not working on 36 samples
(base) xxxxxx@galaxy:~$ gatk --java-options "-Xmx30g" GenomicsDBImport \
(base) xxxxxx@galaxy:~$ gatk --java-options "-Xmx30g -DGATK_STACKTRACE_ON_USER_EXCEPTION=true" GenotypeGVCFs -R Reference/File_S16_uT_3_Pseudochromosomes.fasta -V gendb://ABchroneALL -O ABchroneALL.vcf.gz Using GATK jar /data/xxxxxx/miniconda3/share/gatk4-4.1.6.0-0/gatk-package-4.1.6.0-local.jar Running: java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -Xmx30g -DGATK_STACKTRACE_ON_USER_EXCEPTION=true -jar /data/xxxxxx/miniconda3/share/gatk4-4.1.6.0-0/gatk-package-4.1.6.0-local.jar GenotypeGVCFs -R Reference/File_S16_uT_3_Pseudochromosomes.fasta -V gendb://ABchroneALL -O ABchroneALL.vcf.gz 09:48:14.426 INFO NativeLibraryLoader - Loading libgkl_compression.so from jar:file:/data/xxxxxx/miniconda3/share/gatk4-4.1.6.0-0/gatk-package-4.1.6.0-local.jar!/com/intel/gkl/native/libgkl_compression.so May 27, 2020 9:48:14 AM shaded.cloud_nio.com.google.auth.oauth2.ComputeEngineCredentials runningOnComputeEngine INFO: Failed to detect whether we are running on Google Compute Engine. 09:48:14.871 INFO GenotypeGVCFs - ------------------------------------------------------------ 09:48:14.871 INFO GenotypeGVCFs - The Genome Analysis Toolkit (GATK) v4.1.6.0 09:48:14.872 INFO GenotypeGVCFs - For support and documentation go to https://software.broadinstitute.oAB/gatk/ 09:48:14.872 INFO GenotypeGVCFs - Executing as xxxxxx@galaxy on Linux v4.4.0-133-generic amd64 09:48:14.872 INFO GenotypeGVCFs - Java runtime: OpenJDK 64-Bit Server VM v1.8.0_192-b01 09:48:14.872 INFO GenotypeGVCFs - Start Date/Time: 27 May 2020 09:48:14 CEST 09:48:14.872 INFO GenotypeGVCFs - ------------------------------------------------------------ 09:48:14.872 INFO GenotypeGVCFs - ------------------------------------------------------------ 09:48:14.873 INFO GenotypeGVCFs - HTSJDK Version: 2.21.2 09:48:14.873 INFO GenotypeGVCFs - Picard Version: 2.21.9 09:48:14.873 INFO GenotypeGVCFs - HTSJDK Defaults.COMPRESSION_LEVEL : 2 09:48:14.874 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_READ_FOR_SAMTOOLS : false 09:48:14.874 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_SAMTOOLS : true 09:48:14.874 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_TRIBBLE : false 09:48:14.874 INFO GenotypeGVCFs - Deflater: IntelDeflater 09:48:14.874 INFO GenotypeGVCFs - Inflater: IntelInflater 09:48:14.874 INFO GenotypeGVCFs - GCS max retries/reopens: 20 09:48:14.874 INFO GenotypeGVCFs - Requester pays: disabled 09:48:14.874 INFO GenotypeGVCFs - Initializing engine 09:48:16.015 INFO GenotypeGVCFs - Shutting down engine [27 May 2020 09:48:16 CEST] oAB.broadinstitute.hellbender.tools.walkers.GenotypeGVCFs done. Elapsed time: 0.03 minutes. Runtime.totalMemory()=2301100032
A USER ERROR has occurred: Couldn't create GenomicsDBFeatureReader
oAB.broadinstitute.hellbender.exceptions.UserException: Couldn't create GenomicsDBFeatureReader at oAB.broadinstitute.hellbender.engine.FeatureDataSource.getGenomicsDBFeatureReader(FeatureDataSource.java:410) at oAB.broadinstitute.hellbender.engine.FeatureDataSource.getFeatureReader(FeatureDataSource.java:326) at oAB.broadinstitute.hellbender.engine.FeatureDataSource.(FeatureDataSource.java:282)
at oAB.broadinstitute.hellbender.engine.VariantLocusWalker.initializeDrivingVariants(VariantLocusWalker.java:76)
at oAB.broadinstitute.hellbender.engine.VariantWalkerBase.initializeFeatures(VariantWalkerBase.java:67)
at oAB.broadinstitute.hellbender.engine.GATKTool.onStartup(GATKTool.java:706)
at oAB.broadinstitute.hellbender.engine.VariantLocusWalker.onStartup(VariantLocusWalker.java:63)
at oAB.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:137)
at oAB.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseAABs(CommandLineProgram.java:191)
at oAB.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:210)
at oAB.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:163)
at oAB.broadinstitute.hellbender.Main.mainEntry(Main.java:206)
at oAB.broadinstitute.hellbender.Main.main(Main.java:292)
Caused by: java.io.IOException: GenomicsDB JNI Error: GenomicsDBConfigException : Syntax error in JSON file /data/xxxxxx/ABchroneALL/callset.json
at oAB.genomicsdb.reader.GenomicsDBQueryStream.jniGenomicsDBInit(Native Method)
at oAB.genomicsdb.reader.GenomicsDBQueryStream.(GenomicsDBQueryStream.java:209)
at oAB.genomicsdb.reader.GenomicsDBQueryStream.(GenomicsDBQueryStream.java:182)
at oAB.genomicsdb.reader.GenomicsDBQueryStream.(GenomicsDBQueryStream.java:91)
at oAB.genomicsdb.reader.GenomicsDBFeatureReader.generateHeadersForQuery(GenomicsDBFeatureReader.java:176)
at oAB.genomicsdb.reader.GenomicsDBFeatureReader.(GenomicsDBFeatureReader.java:80)
at oAB.broadinstitute.hellbender.engine.FeatureDataSource.getGenomicsDBFeatureReader(FeatureDataSource.java:407)
... 12 more
GenotypeGVCF working when only six samples are imported in GenomicsDB
gatk GenotypeGVCFs -R Reference/File_S16_uT_chromosomes.fasta -V gendb://GenomicsDB_wd -O test_chromosome_1_6_samples.vcf Using GATK jar /data/xxxxx/miniconda3/share/gatk4-4.1.6.0-0/gatk-package-4.1.6.0-local.jar Running: java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -jar /data/xxxxx/miniconda3/share/gatk4-4.1.6.0-0/gatk-package-4.1.6.0-local.jar GenotypeGVCFs -R Reference/File_S16_Tetranychus_urticae_3_chromosomes.fasta -V gendb://GenomicsDB_wd -O test_chromosome_1_6_samples.vcf 21:14:37.817 INFO NativeLibraryLoader - Loading libgkl_compression.so from jar:file:/data/xxxxx/miniconda3/share/gatk4-4.1.6.0-0/gatk-package-4.1.6.0-local.jar!/com/intel/gkl/native/libgkl_compression.so May 26, 2020 9:14:38 PM shaded.cloud_nio.com.google.auth.oauth2.ComputeEngineCredentials runningOnComputeEngine INFO: Failed to detect whether we are running on Google Compute Engine. 21:14:38.245 INFO GenotypeGVCFs - ------------------------------------------------------------ 21:14:38.246 INFO GenotypeGVCFs - The Genome Analysis Toolkit (GATK) v4.1.6.0 21:14:38.246 INFO GenotypeGVCFs - For support and documentation go to https://software.broadinstitute.org/gatk/ 21:14:38.246 INFO GenotypeGVCFs - Executing as xxxxx@galaxy on Linux v4.4.0-133-generic amd64 21:14:38.246 INFO GenotypeGVCFs - Java runtime: OpenJDK 64-Bit Server VM v1.8.0_192-b01 21:14:38.246 INFO GenotypeGVCFs - Start Date/Time: 26 May 2020 21:14:37 CEST 21:14:38.246 INFO GenotypeGVCFs - ------------------------------------------------------------ 21:14:38.246 INFO GenotypeGVCFs - ------------------------------------------------------------ 21:14:38.247 INFO GenotypeGVCFs - HTSJDK Version: 2.21.2 21:14:38.247 INFO GenotypeGVCFs - Picard Version: 2.21.9 21:14:38.247 INFO GenotypeGVCFs - HTSJDK Defaults.COMPRESSION_LEVEL : 2 21:14:38.247 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_READ_FOR_SAMTOOLS : false 21:14:38.247 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_SAMTOOLS : true 21:14:38.247 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_TRIBBLE : false 21:14:38.247 INFO GenotypeGVCFs - Deflater: IntelDeflater 21:14:38.248 INFO GenotypeGVCFs - Inflater: IntelInflater 21:14:38.248 INFO GenotypeGVCFs - GCS max retries/reopens: 20 21:14:38.248 INFO GenotypeGVCFs - Requester pays: disabled 21:14:38.248 INFO GenotypeGVCFs - Initializing engine WARNING: No valid combination operation found for INFO field AC - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AF - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AN - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field FS - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field InbreedingCoeff - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field MLEAC - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field MLEAF - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field QD - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field SOR - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AC - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AF - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AN - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field FS - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field InbreedingCoeff - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field MLEAC - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field MLEAF - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field QD - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field SOR - the field will NOT be part of INFO fields in the generated VCF records 21:14:39.358 INFO GenotypeGVCFs - Done initializing engine 21:14:39.430 INFO ProgressMeter - Starting traversal 21:14:39.431 INFO ProgressMeter - Current Locus Elapsed Minutes Variants Processed Variants/Minute WARNING: No valid combination operation found for INFO field AC - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AF - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field AN - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field FS - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field InbreedingCoeff - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field MLEAC - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field MLEAF - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field QD - the field will NOT be part of INFO fields in the generated VCF records WARNING: No valid combination operation found for INFO field SOR - the field will NOT be part of INFO fields in the generated VCF records 21:14:44.408 WARN InbreedingCoeff - InbreedingCoeff will not be calculated; at least 10 samples must have called genotypes 21:14:49.472 INFO ProgressMeter - chromosome_1:220314 0.2 39000 233044.5 21:14:59.558 INFO ProgressMeter - chromosome_1:712661 0.3 174000 518732.0 21:15:09.572 INFO ProgressMeter - chromosome_1:1268793 0.5 295000 587240.0 21:15:19.634 INFO ProgressMeter - chromosome_1:1772603 0.7 406000 605924.9 21:15:29.676 INFO ProgressMeter - chromosome_1:2224562 0.8 491000 586327.0 21:15:39.688 INFO ProgressMeter - chromosome_1:2797827 1.0 590000 587483.6 21:15:49.732 INFO ProgressMeter - chromosome_1:3283434 1.2 692000 590611.7 21:15:59.769 INFO ProgressMeter - chromosome_1:3697367 1.3 786000 587027.1 21:16:09.811 INFO ProgressMeter - chromosome_1:4258637 1.5 905000 600796.6 21:16:19.812 INFO ProgressMeter - chromosome_1:4679796 1.7 997000 595929.5 21:16:29.892 INFO ProgressMeter - chromosome_1:5272909 1.8 1123000 609989.0 21:16:39.897 INFO ProgressMeter - chromosome_1:5829289 2.0 1231000 613119.1 21:16:49.935 INFO ProgressMeter - chromosome_1:6383684 2.2 1334000 613314.5 21:16:59.961 INFO ProgressMeter - chromosome_1:6997412 2.3 1435000 612680.6 21:17:11.048 INFO ProgressMeter - chromosome_1:7697582 2.5 1544000 611013.3 21:17:21.141 INFO ProgressMeter - chromosome_1:8295832 2.7 1641000 608867.7 21:17:31.202 INFO ProgressMeter - chromosome_1:9129097 2.9 1754000 612676.2 21:17:41.235 INFO ProgressMeter - chromosome_1:10085011 3.0 1874000 618468.2 21:17:51.251 INFO ProgressMeter - chromosome_1:10960634 3.2 1987000 621520.2