benjjneb / dada2

Accurate sample inference from amplicon data with single nucleotide resolution
http://benjjneb.github.io/dada2/
GNU Lesser General Public License v3.0
468 stars 142 forks source link

learnErrors DADA2 #1897

Closed rosauraalfaro closed 4 months ago

rosauraalfaro commented 7 months ago

I'm working with 16S amplicon sequences from rhizosphere soil and to process my data I'm using the DADA2 version 1.30.0 pipeline in R.

When calculating the error rate of my reverse sequences with the learnErrors function, which is a step before calculating the ASVs, I get the error message on the image.

Error

With my forward sequences I didn't have this problem and I have already been able to calculate the error rates.

These images are of the quality of my reverse sequences.

000010 000012

benjjneb commented 6 months ago

Can you show the output of plotErrors for your forward read error model?

And can you also provide a plotQualityProfile for the forward reads, so we can compare that to what you showed above for the reverse reads that aren't working.

ps: Also, what is the specific sequencing tech you are using? And what is your filterAndTrim command?

rosauraalfaro commented 6 months ago

Hi! thanks for you reply, I'll answer to your questions.

Can you show the output of plotErrors for your forward read error model?

plotError_fwd

And can you also provide a plotQualityProfile for the forward reads, so we can compare that to what you showed above for the reverse reads that aren't working.

plotQualityProfile_FWD plotQualityProfile_FWD2

ps: Also, what is the specific sequencing tech you are using? And what is your filterAndTrim command? The sequencing tech that we used is Illumina. The "filterandTrim" command was: out <- filterAndTrim(fnFs, filtFs, fnRs, filtRs, truncLen=c(240,200), maxEE=c(2,2), truncQ=2, rm.phix=TRUE, compress=TRUE, multithread=TRUE, maxN = c(0,0))

benjjneb commented 6 months ago

It's not clear to me why learnErrors would be working on your forward reads but not reverse.

What amplicon are you sequencing? What is the primer set? Is the primer being sequenced?