bioinformatics-centre / kaiju

Fast taxonomic classification of metagenomic sequencing reads using a protein reference database
http://kaiju.binf.ku.dk
GNU General Public License v3.0
259 stars 68 forks source link

Kaiju Phylogenetic Tree to include in a Phyloseq object #215

Closed magibc closed 2 years ago

magibc commented 2 years ago

Hi!

In my first metagenomic study, I have used Kaiju v.1.8.2 for taxonomic classification and using Progenomes database.

Nevertheless, I would like to know if is there any availble kaiju progenomes phylogenetic tree? Or any idea/hints of how can I construct a phylogenetic tree to include it in a phyloseq object for downstream analysis?

Thanks a lot,

Magi.

pmenzel commented 2 years ago

Creating a phylogenetic tree from a dataset is a different concept than what kaiju is doing. Here, the tree is already given in the form of a cladogram (i.e. the NCBI taxonomy) into which the query sequences are placed.

magibc commented 2 years ago

Thank you @pmenzel for your quick response.

Then from the taxonomical classification obtained from Kaiju, it is not possible to build a phylogenetic tree for downstream analyses? The cladogram that you comments could be downloaded from the Kaiju pipelines?

I know that Kaiju software tool purpose is not phylogenetic tree building but only I would like to know if you can give some hints/recommendations of which steps can I perform from Kaiju's output to build a phylogenetic tree? that I think that this type of question would also help fledging bioinformaticians as me starting in metagenomics field.

Thanks again,

Magi.

pmenzel commented 2 years ago

I don't know what kind of phylogenetic tree you would expect and what you want to do with it. The cladogram is implicit in the structure of the NCBI taxomomy, but there is nothing to download here.

Normally, people use the ouput of metagenomic classifiers to summarize the relative abundance of, for example, species, genera or higher taxonomic ranks within a sample, by using the number of reads assigned to a taxon as a proxy for its abundance in the sample.

magibc commented 2 years ago

Yes @pmenzel , I agree with you.

I try to explain me better. Sorry.

My purpose is to construct a phylogenetic tree (eg.Newick tree format) to include it in a phyloseq object in phyloseq R package. I am interested to create a phylogenetic tree to be able to calculate Unifrac distances (as Beta-diversity metrics) which needs phylogenies information.

Thanks

pmenzel commented 2 years ago

Yeah, I got that, but as I said: I wouldn't know how to make such a tree from shotgun metagenomic data.

magibc commented 2 years ago

Ok thank you Peter for your feedback. I appreciate it !

magibc commented 2 years ago

If someone could helps, I shared my way on how to construct a phylogenetic tree from Kaiju's using proGenomes as database: https://bioinformatics.stackexchange.com/questions/19018/phylogenetic-tree-building-from-progenomes-database-for-shotgun-metagenomics/19124#19124