run_viper input - Githubissues

Dear team,

Thank you for this excellent package.

I am trying to use run_viper function with the dorothea regulons on two different datasets and look for overlap between the two datasets.

For the gene_expression input to run_viper, is this expecting normalised counts? I was planning to use the [http://bioconductor.org/packages/devel/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#variance-stabilizing-transformation](variance stabilised values from DESeq2) for this - is this correct?

Because I am interested in the enrichment similarities between two datasets would you advise that the input should contain the gene counts for all samples from both datasets together like:

tf_activities <- run_viper(assay(vsd_datasets1and2), regulons, options =  list(method = "scale", minsize = 25, eset.filter = FALSE, cores = 4))

Or should I perform run_viper separately for both datasets? and then merge output matrices i.e.:

tf_activities.dataset1 <- run_viper(assay(vsd_dataset1), regulons,options =  list(method = "scale", minsize = 25, eset.filter = FALSE, cores = 4))
tf_activities.dataset2 <- run_viper(assay(vsd_dataset2), regulons,options =  list(method = "scale", minsize = 25, eset.filter = FALSE, cores = 4))
tf_activities..merged <- as_tibble(tf_activities.dataset1, rownames = "TF") %>% full_join(as_tibble(tf_activities.dataset2, rownames = "TF"), by = "TF")

Presumably if together is better then i should run DESeq2 vst function on all samples together too?

Many thanks for your help! Oliver

saezlab / dorothea

run_viper input #31