RWilton / Arioc

Arioc: GPU-accelerated DNA short-read alignment
BSD 3-Clause "New" or "Revised" License
55 stars 8 forks source link

tuning alignment soft-clipping for variant recall #11

Closed mdkeehan closed 3 years ago

mdkeehan commented 3 years ago

arioc-softclipping4 I have noticed that Arioc produces slightly more soft clips in the alignments than an alternative (bwa mem). The IGV screenshot shows an example from the Arioc test data. BWA alignment in the centre: two reads carry the alternative alignment A. Arioc at the bottom soft clips resulting in a single A and a soft clip. My untuned simple test bcftools variant calling pipeline then produces an A variant call from BWA but not from Arioc.

Overall I have noticed that Arioc alignments result in precise variant calls (in very high agreement to BWA alignment variant calls) but that BWA alignments will generate more lower quality snps in a subsequent variant calling stage. This is likely caused by the default settings for bcftools variant calling being compatible with BWA. I attribute many of these to the MAPQ differences. However my users would be concerned by the lower recall.

  1. The Arioc papers mention that the non gapped aligner will produce soft clips under certain conditions. Is there a user controllable parameter to tune soft clipping?

  2. Suggestions on using arioc as a drop in replacement aligner for a variant calling pipeline in either a high-precision or a high-recall variant calling setting might be helpful to users.

RWilton commented 3 years ago

Can you please verify that you are using both Arioc and BWA with the same alignment scoring parameters? Your screen snap shows an AS (alignment score) of 196 for the mapping reported by Arioc versus 98 for BWA's mapping. Unless you use the same scoring parameters, it is hard to make any head-to-head comparison of specific read mappings by two different aligners.

I am very much interested in the differences between Arioc and BWA in terms of variant calling, in particular because Arioc's model for MAPQ computation differs from that of BWA-MEM. But that's somewhat tangential to the question of soft clipping -- could we continue that discussion offline?

mdkeehan commented 3 years ago

Hi Richard,

Happy to continue discussing the differences in the variant calling offline.

Mike

On Fri, Dec 18, 2020 at 5:07 AM RWilton notifications@github.com wrote:

Can you please verify that you are using both Arioc and BWA with the same alignment scoring parameters? Your screen snap shows an AS (alignment score) of 196 for the mapping reported by Arioc versus 98 for BWA's mapping. Unless you use the same scoring parameters, it is hard to make any head-to-head comparison of specific read mappings by two different aligners.

I am very much interested in the differences between Arioc and BWA in terms of variant calling, in particular because Arioc's model for MAPQ computation differs from that of BWA-MEM. But that's somewhat tangential to the question of soft clipping -- could we continue that discussion offline?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/RWilton/Arioc/issues/11#issuecomment-747536408, or unsubscribe https://github.com/notifications/unsubscribe-auth/AFAOG5PJFHORMLN2QWEJS7DSVIUELANCNFSM4UZ22S2A .