Filtering facet.by plots by significant pvalues

kassambara / ggpubr

'ggplot2' Based Publication Ready Plots

1.13k stars 165 forks source link

p <- ggboxplot(as.data.frame(exp), x = "oncogene", y = "value", facet.by = "external_gene_name", color = "oncogene", add = "jitter") + stat_compare_means(comparisons = my_comparisons, method.args = list(alternative = "greater"))

Hi,

I would suggest the following procedure

Perform a differential expression analysis between group to keep only significant genes (using limma)
Visualize some of key genes differentially expressed (using ggpubr)

If you want to do the filtering process in ggpubr, you can go as follow.

Load packages:

library(tidyverse)
library(ggpubr)

Prepare some data:

# Prepare some data
df <- iris %>%
  as_tibble() %>%
  gather(key = "gene", value = "expression", -Species) %>%
  rename(group = Species)
df

# A tibble: 600 x 3
   group  gene         expression
                  
 1 setosa Sepal.Length        5.1
 2 setosa Sepal.Length        4.9
 3 setosa Sepal.Length        4.7
 4 setosa Sepal.Length        4.6
 5 setosa Sepal.Length        5  
 6 setosa Sepal.Length        5.4
 7 setosa Sepal.Length        4.6
 8 setosa Sepal.Length        5  
 9 setosa Sepal.Length        4.4
10 setosa Sepal.Length        4.9
# ... with 590 more rows

Perform Anova to filter out not significant genes (Anova adjusted p-value > 0.05)

res.stats <- compare_means(expression ~ group, group.by = "gene", data = df, method = "anova") %>%
  filter(p.adj > 0.05)

Visualize some of significant genes

kassambara / ggpubr

Filtering facet.by plots by significant pvalues #122