Closed felipeiani closed 4 months ago
In order to keep the program relatively simple, consensus
merely applies variants from the VCF regardless of their depth, VAF, AF, or other properties. It does support the -i/-e
filtering options, but if more more advanced logic is required, the VCF has to be preprocessed with other tools, such as the +setGT plugin (http://samtools.github.io/bcftools/howtos/plugin.setGT.html). I suggest you use that to enforce the desired genotype.
To cross reference, this issue is similar to #1488 or https://github.com/samtools/bcftools/issues/1516#issuecomment-875179854.
For genomic analyses, primarily but not exclusively viral, we have been using a pipeline called Viralunity (https://github.com/filiperomero2/ViralUnity). However, I have noticed that depending on the reference used, the results vary significantly. For example, at some sites, I may have the nucleotide "a" sequenced at a depth of 1000X and "c" only 5X. If the reference is "c", the consensus will come out as "C". The command we are using within the script is:
Is that so? Is there any additional command we could add to set a minimum depth or frequency? Or perhaps specify that the consensus should have a minimum proportion, such as 70% for the most frequent base and 30% for the least frequent, to be considered true?
Sinceraly,