jtamames / SqueezeMeta

A complete pipeline for metagenomic analysis
GNU General Public License v3.0
371 stars 79 forks source link

Taxa Bar plot #531

Closed aalkin2001 closed 2 years ago

aalkin2001 commented 2 years ago

Taxa bar plot.pdf Hello, I ran SqueezeMeta script on microbial DNA enriched and unenriched samples and generated the bar plot for the percentage abundance at phylum and species levels. But I noticed from the bar plot generated that there are more unmapped reads in the microbial DNA enriched (HN65S1_mc) sample.

Here are my questions:

  1. I just want to know what the unmapped reads in the bar chart represent.
  2. Why do we have more unmapped reads in the enriched sample, technically, since there is more microbial DNA present in the enriched sample, we should have more reads mapped not the other way.

I am kind of confused with the result generated. Would you please shed more light on why the bar result is like this?

Thank you.

jtamames commented 2 years ago

Hello The unmapped reads are not unclassified taxa, but reads that were not assembled in the assembly step. These reads are therefore not represented in the analysis. You can check the 10.mappingstat files, and you should see that the percentage of reads not in the assembly matches that number. The reasons for this may be many, usually related to the sequencing depth and the diversity and richness of the microbiome. Did you sequence less for the enriched sample than for the other? Best, J

aalkin2001 commented 2 years ago

Hi,

Thank you for your response and clarifications. Yes, I did sequence less of the enriched sample.