The following fields for VEP are observed as IntegerType for SNPs, but for indels can be StringType
"cDNA_position" -> IntegerType, "CDS_position" -> IntegerType, "Protein_position" -> IntegerType
This is because indels are specificed as a range of positions, for example, "48-49"
A consequence is that any VCF rows with indels will have their VEP INFO fields read in as "null"
The following fields for VEP are observed as IntegerType for SNPs, but for indels can be StringType
"cDNA_position" -> IntegerType, "CDS_position" -> IntegerType, "Protein_position" -> IntegerType
This is because indels are specificed as a range of positions, for example, "48-49" A consequence is that any VCF rows with indels will have their VEP INFO fields read in as "null"https://github.com/projectglow/glow/blob/8b0bcd6b2f7320c3a5bd186bdcfa4707af303b47/core/src/main/scala/io/projectglow/vcf/AnnotationUtils.scala#L58