ucsd-ccbb / C-VIEW

This software implements a high-throughput data processing pipeline to identify and charaterize SARS-CoV-2 variant sequences in specimens from COVID-19 positive hosts or environments.
MIT License
9 stars 2 forks source link

[URGENT] Change samtools depth command #141

Closed niemasd closed 2 years ago

niemasd commented 2 years ago

Currently, we run samtools depth like this:

samtools depth -d 0 -Q 0 -q 0 -aa TRIMMED_SORTED.BAM > DEPTH.TXT

We got this from the Andersen lab pipeline:

However, I was tinkering with ViReflow, and I noticed that this samtools depth command is missing a critical argument: -J (Include reads with deletions in depth computation). I think this might be the culprit behind the "bcftools consensus bug" Kristian had told us about in the past: by omitting -J, we are accidentally undercounting the coverage at sites in which reads have deletions. In my experimentation, simply adding -J completely fixes this. Thus, I propose we (urgently) change our samtools depth command to the following:

samtools depth -J -d 0 -Q 0 -q 0 -aa TRIMMED_SORTED.BAM > DEPTH.TXT