Is dada2 suitable for Nextera XT + viral genome ?

benjjneb / dada2

Accurate sample inference from amplicon data with single nucleotide resolution

GNU Lesser General Public License v3.0

459 stars 142 forks source link

Hi,

I have been wrangling some metabarcoding MiSeq data using dada2 and otu clustering to help simplify the search for recombinant sequences. I have managed to run through my make-shift workflow, but am wondering if dada2 was suitable for this work in the first place.

i'll explain my experimental setup and then how i used dada2.

A cell culture is infected with a virus and an element that should result in recombinant viral genomes in some. This heterogenous culture is extracted and 2-3 amplicons (2-4kb) are generated. These amplicons are indexed using Nextera XT ( so are fragmented), and then sequenced on MiSeq (150 x 2). so results are at most 150bp reads.

my concerns are as follows. since my sample is very mixed (amplicon loci, different tagmentation fragments, possible recombinant), does that interfere with learnerrors?

Since im looking for recombinants, I found a way to extract the list of chimeras that would normally be filtered out. I can then scan this subset for genuine mutants.

does this makes sense? or am i way off..

benjjneb / dada2

Is dada2 suitable for Nextera XT + viral genome ? #1529