We saw earlier that a problem with E/ASVs is the risk of splitting the operons from the same genome into multiple taxonomic groups. I would like to determine the threshold we should use so that 95% of the genomes are represented by a single E/ASV.
Create a line plot of the 95th percentile for each threshold at different number of operons per genome. This should be faceted by region within the 16S rRNA gene.
Create a line plot showing the 95th percentile line for number of E/ASVs per genome for each region when we only consider those genomes with 7 copies of the rrn operon
We saw earlier that a problem with E/ASVs is the risk of splitting the operons from the same genome into multiple taxonomic groups. I would like to determine the threshold we should use so that 95% of the genomes are represented by a single E/ASV.
Notes: