vrmarcelino / CCMetagen

Microbiome classification pipeline
GNU General Public License v3.0
64 stars 19 forks source link

how to represent top most abundant species #26

Closed saras224 closed 3 years ago

saras224 commented 3 years ago

Hi I want to represent top 25 species from all the samples. I have a merged file which is a text file, has rows as the species and columns as the samples and values are the relative abundances. Now how can I select top 25 abundant species from all the samples?

Thanks Saraswati

vrmarcelino commented 3 years ago

Hi Saraswati!

The post-processing of taxonomic assignments is not covered within the functionality of CCMetagen, but there are many ways you can do this. R is my favourite. Have a look at PhyloSeq and other programs R modules. The CCMetagen output you have can be used as an "otu table" for PhyloSeq.

We also have a tutorial here: https://github.com/vrmarcelino/CCMetagen/tree/master/tutorial, it is a bit outdated as the CCMetagen output changed slightly since I wrote it (and PhyloSeq probably changed as well) - in this new version you will need to handle the rows with unclassified tax ranks (e.g. unknown species of genus X) in the way that best works for you. I hope the tutorial will help you to get started.

Vanessa

saras224 commented 3 years ago

Thank you so much for your response. It was really helpful. Saraswati Awasthi

On Fri, 11 Jun 2021 at 04:28, VR Marcelino @.***> wrote:

Hi Saraswati!

The post-processing of taxonomic assignments is not covered within the functionality of CCMetagen, but there are many ways you can do this. R is my favourite. Have a look at PhyloSeq and other programs R modules. The CCMetagen output you have can be used as an "otu table" for PhyloSeq.

We also have a tutorial here: https://github.com/vrmarcelino/CCMetagen/tree/master/tutorial, it is a bit outdated as the CCMetagen output changed slightly since I wrote it (and PhyloSeq probably changed as well) - in this new version you will need to handle the rows with unclassified tax ranks (e.g. unknown species of genus X) in the way that best works for you. I hope the tutorial will help you to get started.

Vanessa

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/vrmarcelino/CCMetagen/issues/26#issuecomment-859138693, or unsubscribe https://github.com/notifications/unsubscribe-auth/ASPIOTJSMJVGQANXOFQZIQDTSE7RDANCNFSM46OR4HKA .