ggloor / ALDEx_bioc

ALDEx_bioc is the working directory for updating bioconductor
26 stars 13 forks source link

Question regarding the significance of the results #56

Closed vdijckthomas closed 1 year ago

vdijckthomas commented 1 year ago

Dear all,

I recently performed an analysis on my dataset:

I used the the aldex() function with two groups: aldex2_outp <- ALDEx2::aldex(otu.tab_transp, phyloseq::sample_data(ps_meta)$Vegetation, mc.samples=128, test="t", effect = TRUE, denom="all", verbose=FALSE, paired.test=FALSE)

I have some difficulties with interpreting the results. I have attached the obtained plots down below. plot_ALDEx2

The plots show that 51 ASVs are statistically significant (red dots). To the best of my knowledge, it is advised to look at the effect size of the dots. As can be seen in the effect plot (rightside) no ASV has an effect size larger than 1 (ratio between group variation/within group variation). The within-group variation is larger than the between-group variation.

Can I conclude that no ASV is differentially expressed between both groups, and that the ASVs that are annotated as statistically significant are a consequence of the within-group variation of these ASVs?

Many thanks for your help,

Kind regards.

Thomas

ggloor commented 1 year ago

Just looking at the data there is a lot of variance for sure. It looks to me like a lot of sparsity but there ar 4 ASVs that are relatively abundant. Weirdly, on the effect plot, they still have very large dispersion. Are they found in all biodiverse samples or only a subset?

It would be useful to perform a compositional biplot to observe if the biodiverse and controls separate

vdijckthomas commented 1 year ago

Thank you for your fast reply!

Would you be so kind as to explain how I can find these 4 ASVs in my plot so I can have a look at the samples they are found in?

Regarding the compositional biplot; Initially, I performed an NMDS [dissimilarity matrix: mean values of repeated rarefaction using avgdist()]. Group (Sedum vs biodiverse) dispersions were homegeneous and the permutational analysis of variance [adonis()] showed that both groups differed in composition, although only a slight fraction of the variation could be explained by this (Rsquared = 0.053). I just made a compositional biplot, indicating the same result? ps_meta %>% tax_transform("clr") %>% ord_calc() %>% ord_plot(color = "Vegetation", size = 2) + scale_colour_brewer(palette = "Paired")

PCA_biplot Or grouped per habitat: PCA_per_habitat

Thank you!

vdijckthomas commented 1 year ago

Any insights?

ggloor commented 1 year ago

x.e <- aldex.effect(x)

rownames(x.e)[which(x.e$rab.all> 5 & x.e$diff.btw < -4)]

On May 15, 2023, at 11:05 AM, vdijckthomas @.***> wrote:

Any insights? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>

vdijckthomas commented 1 year ago

Thank you for your fast reply. These values (rab.all> 5 and diff.btw < -4) are arbitrary? Running your script resulted in 3 ASVs. The number of samples (%) they are found in, per condition, is summarize below:

  | ASV1 | ASV2 | ASV3 Biodiverse | 69,6 | 83,3 | 77,5 Sedum | 28,8 | 31,5 | 20,5

ggloor commented 1 year ago

this was looking at your MA plat with the four clustered ASVs on the bottom at relatively high abundance.

I'm sorry, but I cannot help you with you analysis beyond interpretation of the plots.

Greg

On May 15, 2023, at 12:18 PM, vdijckthomas @.***> wrote:

Thank you for your fast reply. These values (rab.all> 5 and diff.btw < -4) are arbitrary? Running your script resulted in 3 ASVs. The number of samples (%) they are found in, per condition, is summarize below: | ASV1 | ASV2 | ASV3 Biodiverse | 69,6 | 83,3 | 77,5 Sedum | 28,8 | 31,5 | 20,5 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: @.***>

vdijckthomas commented 1 year ago

No problem, thank you for your effort.