benjjneb / dada2

Accurate sample inference from amplicon data with single nucleotide resolution
http://benjjneb.github.io/dada2/
GNU Lesser General Public License v3.0
460 stars 141 forks source link

Is denoising required for kraken analysis? #1952

Closed BrinthaVP closed 1 month ago

BrinthaVP commented 1 month ago
          > Would it be best to use the discarded reads for **this** or reads still present at the end of the Dada2 pipeline?

If "this" means the analysis I proposed above, ASVs refer to the output of DADA2. Kraken would be used on the raw reads.

Originally posted by @benjjneb in https://github.com/benjjneb/dada2/issues/1721#issuecomment-1515405061

Could you please point out to any relevant material on "Why denoising not required for kraken analysis"

benjjneb commented 1 month ago

I'm not sure what you are asking here?

BrinthaVP commented 1 month ago

Currently I assume that kraken can be ran on raw reads and does not require any denoising as the output of DADA2 is actually ASV's. Still it is possible that we can take the denoised sequences and ran Kraken on them.

I saw one of your comments also saying that Kraken is used on raw reads. It is only referring to that comment, I requested to point out to any relevant material on "Why denoising not required for kraken analysis?".

I am just wondering whether Kracken+Bracken performance be similar to Dada2+Qiimeclassifier.

benjjneb commented 1 month ago

I requested to point out to any relevant material on "Why denoising not required for kraken analysis?".

The paper proposing using Kraken for 16S profiling ran Kraken on reads, not ASVs. https://doi.org/10.1186/s40168-020-00900-2

I am just wondering whether Kracken+Bracken performance be similar to Dada2+Qiimeclassifier.

When I have tried them, metagenomic classifiers like Kraken produce very high numbers of false positive taxa from 16S data.

BrinthaVP commented 1 month ago

Thanks for your reply. I will look into it.