assigntaxonomy confidence?

benjjneb / dada2

Accurate sample inference from amplicon data with single nucleotide resolution

GNU Lesser General Public License v3.0

469 stars 142 forks source link

Hey all,

If this has already been discussed, feel free to point me in that direction! I looked through and couldn't find anything but it's possible I missed it!

In discussing species/genus level assignment with a colleague, we started trying to figure out what the minimum sequence length from 16s you would need to be confident in the assignment that was made (i.e Kingdom, phylum, etc...). For example, if you have poor quality reads, only use the forwards, and have to trim 16s V3/V4 down to 125 nt, could you still be confident in genus level assignment? We've been analyzing some older data in prep for a dada2 run-through, and found some data that matched this description, but don't know whether we can trust it or not. And at what point can we trust it? Is there some guidance as to where these cutoffs should be made?

benjjneb / dada2

assigntaxonomy confidence? #976