Zymo-Research / figaro

An efficient and objective tool for optimizing microbiome rRNA gene trimming parameters
GNU General Public License v3.0
80 stars 25 forks source link

when to apply trimming parameters #52

Open smzt opened 1 year ago

smzt commented 1 year ago

Hi, I've been going through the documentation and the paper but I have not been able to find a clear answer yet to my naive question. When should I apply the values Figaro provides? Should I apply them before trimming the primers or after trimming the primers?

Thanks in advance.

Regards,

Sheila

alegione commented 11 months ago

My understanding if helpful: The tool is written for Dada2, so the output results should be used as truncation lengths in dada2. The tool truncates reads first, then removes the primers if you ask it to also trim the ends.

Example if using dada2 in R (all caps text to be replaced with your data)

fastqPairedFilter(fn = c(FORWARD_IN, REVERSE_IN), fout = c(FORWARD_OUT, REVERSE_OUT), truncLen = c(FIGARO TRIM POSITION 1, FIGARO TRIM POSITION 2), verbose = TRUE, maxN = 0, maxEE = 2, truncQ = 2, trimLeft = c(FORWARD PRIMER LENGTH, REVERSE PRIMER LENGTH), compress = TRUE)

Example if using dada2 in QIIME2

qiime dada2 denoise-paired \
    --i-demultiplexed-seqs demux-paired.qza \
    --p-trunc-len-f FIGARO TRIM POSITION 1 \
    --p-trunc-len-r FIGARO TRIM POSITION 2 \
    --p-trim-left-f FORWARD PRIMER LENGTH \
    --p-trim-left-r REVERSE PRIMER LENGTH \
    --o-representative-sequences representative-sequences.qza \
    --o-table table.qza \
    --o-denoising-stats denoising-stats.qza