benjjneb / dada2

Accurate sample inference from amplicon data with single nucleotide resolution
http://benjjneb.github.io/dada2/
GNU Lesser General Public License v3.0
468 stars 142 forks source link

trimOverhang with ITS data #1490

Closed andressamv closed 4 months ago

andressamv commented 2 years ago

Hi, this is my first time using Dada2 to work with ITS. I sequenced the ITS1-1F region using 2 x 250 bp, so I have a lot of variation in the amplicon size. I followed the ITS workflow so far (https://benjjneb.github.io/dada2/ITS_workflow.html), including the removal of primers using Cutadapt. However, it is not clear to me if I should use the trimOverhang = TRUE argument during the merge because of this variation. How would that affect my results?

benjjneb commented 2 years ago

trimOverhand=TRUE will only be relevant to your results if your sequenced amplicon is sometimes smaller than the read size, i.e. less than 250bp in your case. Do you know if the ITS1 region you are sequencing is sometimes less than 250bp?

That said, if uncertain, trimOverhang=TRUE is perhaps a safer choice. If the amplicon is >250 bps, this will have no effect, and if it is <250bp it will appropriately trim off the overhang on each side.

Andreas-Bio commented 2 years ago

@andressamv You don't need to worry about overhangs if you use ITSx software to cut everything except the ITS1 region.