rki-mf1 / CoVpipe2

SARS-CoV-2 genome reconstruction for Illumina data in Nextflow
GNU General Public License v3.0
3 stars 1 forks source link

check bcftools consensus generation / ambiguity calls #41

Closed MarieLataretu closed 2 years ago

MarieLataretu commented 2 years ago

Current parameter: bcftools consensus [...] --sample {name} -I VCF with FORMAT/GT and exactly one sample

Differences -I/-H I:

(https://github.com/samtools/bcftools/issues/1708#issuecomment-1114804855, https://github.com/samtools/bcftools/issues/1400#issuecomment-777286784)

Should we use --sample {name} -H I instead of --sample {name} -I, because FORMAT/GT is not considered?


Related issues: https://github.com/samtools/bcftools/issues/1516 https://github.com/samtools/bcftools/issues/1400 https://github.com/samtools/bcftools/issues/1708

Manual: http://samtools.github.io/bcftools/bcftools.html#consensus

MarieLataretu commented 2 years ago

Test results:

no FORMAT/GT, no --sample FORMAT/GT, no --sample FORMAT/GT, --sample
-I IUPAC REF + all ALT IUPAC REF + all ALT IPUAC GT *)
-H I REF na IPUAC GT *)

*) of respective sample

-> -I/-H I makes no difference, if there is also --sample {name}

I'd rather switch to -H I though, because it's more explicit for what we want