genome-in-a-bottle / giab_latest_release

This repository contains information about latest release from Genome in a Bottle project
73 stars 5 forks source link

VCF INFO datasetsmissingcall should be Type=String #3

Closed heuermh closed 3 years ago

heuermh commented 7 years ago

The file

HG001_GRCh38_GIAB_highconf_CG-IllFB-IllGATKHC-Ion-10X-SOLID_CHROM1-X_v.3.3.2_all.vcf.gz

contains the header line

##INFO=<ID=datasetsmissingcall,Number=.,Type=Integer,Description="Names of
datasets that are missing a call or have an incorrect call at this location, and the
high-confidence call is a variant">

The description Names of... and Type=Integer do not match, and INFO field values such as

datasetsmissingcall=10XChromium,IonExome,SolidPE50x50bp,SolidSE75bp

throw NumberFormatException: For input string: "10XChromium" in our application.

jzook commented 3 years ago

This has been corrected in v4.2.1. Thanks for reporting!

heuermh commented 3 years ago

Thank you, @jzook! I'll take a look at the latest release