bcftools consensus with deletions starting outside region of interest #2091

nage0178 commented 5 months ago

Is there a way for the consensus command to take into account deletions that start outside of the region of interest, analogous to the -r option for other commands? For example, if the sample from individual_ID has a deletion starting at 5219000 and ending at 5221000, the command below will not remove the deletion. I do not want the deletion in the consensus sequence.

samtools faidx reference.fa chr11:5220000-5227000 > reference_region.fa
cat reference_region.fa |bcftools consensus -H 1pIU -s individual_ID --mark-del - --mark-ins lc example.vcf.gz
pd3 commented 5 months ago

There is now a new option --regions-overlap which works similarly to other commands: overlapping variants starting outside of the target region of the fasta file can be ignored with --regions-overlap 0 or taken into account when set to 1 or 2.

Note for the latter there will be ambiguous cases, see

Please test this out and let me know if you encounter anything odd.

nage0178 commented 5 months ago

Thank you for your response. The --regions-overlap does not appear to work with the consensus command. The command

cat reference_region.fa|bcftools consensus -H 1pIU -s individual_ID --mark-del - --mark-ins lc --regions-overlap 1 example.vcf.gz

gives the following output

consensus: unrecognized option '--regions-overlap'

   # See also

I confirmed I have the latest version of bcftools and the --regions-overlap options works with the view command.

pd3 commented 5 months ago

This feature was added few hours ago by You'll need to update to the latest github version