mmpust / airway-metagenome-simulations

0 stars 0 forks source link

gapseq usage #1

Open arianccbasile opened 2 years ago

arianccbasile commented 2 years ago

Hello :) Very nice paper, congratulations. I've just one question, in the manuscript you stated that you used gapseq to "facilitate the taxonomic and functional identification of core and rare species from shotgun metagenomic sequencing data and reference genomes with omission rates". Can I ask you what you did exactly? It is not completely clear to me.

Best, Arianna Basile

mmpust commented 2 years ago

Dear Arianna, Thank you for your question and I am sorry the sentence was not clear enough! So, we used the raspir tool to filter microbial taxa from our metagenomics patient samples after reference-based alignment. Raspir enabled us to also include low abundance taxa, which are otherwise typically discarded. After this filtering step, we selected the reference genomes of the remaining species and sent those reference sequences into the gapseq pipeline, allowing us to investigate the functional and metabolic repertoire of these reference genomes as well. Does this answer your question? best wishes, Marie

arianccbasile commented 2 years ago

Thank you Marie for your quick answer. Now it is clearer, thank you. Just to be sure, you used gapseq to find pathways and transporters but without running any simulation with the metabolic reconstructions obtained, right?

Best, Arianna

mmpust commented 2 years ago

Dear Arianna, yes, exactly. We did neither perform metabolic reconstruction nor balanced flux analysis. But gapseq calculates a completeness score per pathway per reference genome, which is very powerful. We then just extracted this scoring information for the downstream data analysis. Though, it would have been much more powerful to directly assemble bacterial genomes from patient samples and then do the metabolic modeling directly with MAGs from patient samples. But we would have just captured the high abundance taxa with this approach and in this publication, we were particularly interested in the contribution of the low abundance taxa. Best wishes, Marie