Closed JbaptisteLam closed 2 months ago
This is probably due to not well-formed genotypes on samples columns. Indeed, input file format may contains a mix of columns, not only in VCF format (such as a TSV file), to include columns that are annotations (or whatever). Identify sample columns (those with genotypes) is tricky. Moreover, to ensure correct calculations on genotypes (e.g. trio calculation), format should be well formed.
A fix had been done (#263) to be more flexible on genotype format, and an option had been added to ensure that the list of sample is well-defined.
Try:
bcftools view input.vcf -o input.checked.vcf
).
HOWARD (last devel version) installed with pip in a conda env
I'm trying to annotate a multi sample vcf from WGS analysis with a huge database in parquet. Annotation goes well but there are no sample column anymore in the output file (the last one in mandatory column is FORMAT), Moreover, I can't reproduce this behaviour with only a few variants from my cohort...
Best,
Jean-Baptiste