Closed kopardev closed 10 months ago
I opened this up in an interactive RStudio session on biowulf using the dataset in /scratch/RENEE_testing_240103
. I see that only one gene passes the filterByExpr function, causing this error in voom. However, on a different run of the test data I get 6 genes passing the filter. So it seems for some reason RSEM is giving different expected counts on the same dataset.
6987849
this hash was from an unmerged branch which changed the multiqc container just after v2.5.8, so RSEM should have behaved the same as it did in v2.5.8.
rawcounts <- read.table(file = '/data/sovacoolkl/renee_test/DEG_ALL/RSEM.genes.expected_counts.all_samples.reformatted.tsv', sep = '\t', header = TRUE, row.names = 1, quote = "")
deg <- edgeR::DGEList(counts = rawcounts)
keep_genes <- edgeR::filterByExpr(deg)
keep_genes %>% Filter(function(x) isTRUE(x),.) %>% dput()
c(`ENSG00000185658.13|BRWD1` = TRUE, `ENSG00000157540.21|DYRK1A` = TRUE,
`ENSG00000160294.11|MCM3AP` = TRUE, `ENSG00000141959.17|PFKL` = TRUE,
`ENSG00000182670.13|TTC3` = TRUE, `ENSG00000160201.11|U2AF1` = TRUE
)
3c747aa
rawcounts <- read.table(file = '/data/sovacoolkl/RENEE_testing_240103/DEG_ALL/RSEM.genes.expected_counts.all_samples.reformatted.tsv', sep = '\t', header = TRUE, row.names = 1, quote = "")
deg <- edgeR::DGEList(counts = rawcounts)
keep_genes <- edgeR::filterByExpr(deg)
keep_genes %>% Filter(function(x) isTRUE(x),.) %>% dput()
c(`ENSMUSG00000106106.2|CT010467.1` = TRUE)
I used this to launch a test run with v2.5.10:
module load ccbrpipeliner
renee run \
--input /data/CCBR_Pipeliner/Pipelines/RENEE/develop/.tests/*.R1.fastq.gz \
--output /data/sovacoolkl/renee_test_v2.5.10 \
--genome hg38_30 \
--mode slurm \
--sif-cache /data/CCBR_Pipeliner/SIFS
@kopardev is launching another test run with --genome hg38_30
, he previously used the default (mouse) genome. This is almost certainly the cause of the issue, but let's wait to close it until Vishal's new test run succeeds.
@kopardev did your new test run succeed?
Vishal's test succeeded