Open mj-jimenez opened 1 year ago
I honestly think this is not an issue but removing the prefiltering step is also unlikely to cause any harm. On the other hand, generating one dds object for each comparison should not be the default approach for any RNA-Seq analysis.
PD: The aforementioned design can be simplified to a single compound condition, unless users are interested in specific interaction terms.
In
deseq_init.R
, this line removes genes with less than 10 counts in all samples. This step has been copied from the deseq2 vignette and aims to reduce memory usage and speed up the computation.However, I was told that some users create different
deseq2
objects from the same count matrix in order to compare two groups of samples. For example, given this design matrix:Some users run two analyses to compare treatment vs control within each genotype (instead of modelling a complex design such as
~genotype + condition
). As some genes may have low count number within one of the genotypes, the normalized counts will not contain exactly the same genes.First, I would like to know how common this procedure is. Maybe some regular users can give feedback @ELENAPINEIRO @ralvarez-hub @jlanillos @lserranor @Maria-rfranklin.
As the authors state in the vignette, this is not an essential step. If the above procedure is standard, maybe we can just remove these lines. @SGMartin what do you think?