cbg-ethz / V-pipe

V-pipe is a pipeline designed for analysing NGS data of short viral genomes
https://cbg-ethz.github.io/V-pipe/
Apache License 2.0
129 stars 43 forks source link

Warning: All reads at position 4045 in the same reverse orientation ? #126

Open Vinoy-UKHSA opened 2 years ago

Vinoy-UKHSA commented 2 years ago

Hello V-pipe team, Thanks for the wonderful tool. My v-pipe works fine, however I am getting this warning (Warning: All reads at position 4045 in the same reverse orientation ?) for number of position around 50, and don't know what is wrong in the dataset. Can you please explain me why I get this warning and how to rectify this ? Thanks Vinoy

DrYak commented 2 years ago

Hello!

If I an not wrong, this is mostly due to ShoRAH not (yet) being designed for multiplex PCR protocols, like the typical ARTIC v3/v4 that we use for wastewater.

ShoRAH assumes randomly distributed reads (e.g.: like in some fragmented protocols, that sequence 50~70bp, used for clinical swabs), so it positions its windows at regular intervals. That's not the case with the typical ~250bp pair ended sequencing that we use with SARS-CoV-2 wastewater, there all reads start at the extremities of the multiplex PCR amplicons, and end also all roughly at similar places.

This has two consequences:

We have PhD and Master students working on improving ShoRAH (making it multiplex PCR amplicon aware, and correctly matching the windows to the boundaries of the amplicons - this will completely negate the problems mentionned above) which will be released in the comming weeks.