Applying FDR-adjusted test on UniFrac Distances

joey711 / phyloseq

phyloseq is a set of classes, wrappers, and tools (in R) to make it easier to import, store, and analyze phylogenetic sequencing data; and to reproducibly share that data and analysis with others. See the phyloseq front page:

http://joey711.github.io/phyloseq/

579 stars 188 forks source link

Applying FDR-adjusted test on UniFrac Distances #593

Closed AndreaQ7 closed 8 years ago

AndreaQ7 commented 8 years ago

Hi everyone,

I'm trying to apply FDR-adjusted test on UniFrac distances...does anyone know how to perform it? I imagine that at first I should perfom distance(data,"wunifrac") but then? Have a nice day you all

joey711 commented 8 years ago

Hi @AndreaQ7

Your question is a bit under-specified. To comment on multiple testing I would need to know what test you have in mind. For instance, if you were going to do an adonis test on a model with multiple sample-variables, multiple hypotheses are already accounted for in the F-test derived p-values. However, if you did multiple adonis tests, one at a time (not recommended), then you would need a correction.

Was this the test you had in mind with w-unifrac, or was it something else?

AndreaQ7 commented 8 years ago

Hi Joey, Thank you for your reply. Yes you right I've been bit superficial in my description sorry. I'm performing metagenomic analysis of sample from 5 different groups. I create a distance matrix with Weighted and Unweighted Unifrac, then I make the plot_ordination. But I would like to assign a statistical significance to the distance among the groups like : Treated vs No_Treated; Treated vs Treated etc.... Do you know how can I perform a statistical analyis to assign a significance of distances between pairs of groups? At the beginning I was thinking to performe a mt and than a correction for mt but I think that's not the right test to perform in this case. Thank you

michberr commented 8 years ago

As Joey alluded to, it sounds like you are looking for a PERMANOVA test (which can be implemented in R with the vegan adonis command). There's lots of documentation for this function online.

I would recommend reading this page, as well as many of the other pages on the gustame site for an introduction to multivariate statistics applied to microbiome datasets.

joey711 commented 8 years ago

Thanks @michberr ,

it also sounds worth pointing out that there is a big difference between testing for difference between

(1) whole microbial communities, or some fingerprint thereof, and (2) differences in relative abundance of individual taxa, or some transformation thereof

These are very different types of biological inference, and are computed with very different statistical methods.

I will close for now.

Make sure to also take a look at the new phyloseq FAQ.

AndreaQ7 commented 8 years ago

Thank you a lot guys for your information and suggestions