SPEAQeasy: portable LIBD RNA-seq pipeline using Nextflow. Check http://research.libd.org/SPEAQeasy-example/ for an example on how to use this pipeline and analyze the resulting output files.
It might be useful to document at http://research.libd.org/SPEAQeasy/ where the different pieces for the RSE objects come from. So that might look like a subsection of http://research.libd.org/SPEAQeasy/outputs.html#main-outputs where we explain where each of the pieces (that don't overlap with say the quality metrics already explained in that section) of the RSE objects come from. That is, the rowRanges() and colData() variables. These do change by feature type (gene and exon are the most similar). I think that this will help new users, particularly those learning about SummarizedExperiment as well. It'll make it clearer which parts are common to RSE objects and which are parts we compute as part of SPEAQeasy.
Hi,
I initially got confused myself looking for how
geneMap$Length
is generated. I don't see it at https://github.com/LieberInstitute/SPEAQeasy/blob/master/scripts/build_annotation_objects.R yet it's used by https://github.com/LieberInstitute/SPEAQeasy/blob/master/scripts/create_count_objects-human.R#L529. I later realized that it comes fromfeatureCounts
, specifically from http://bioinf.wehi.edu.au/subread-package/SubreadUsersGuide.pdf.It might be useful to document at http://research.libd.org/SPEAQeasy/ where the different pieces for the RSE objects come from. So that might look like a subsection of http://research.libd.org/SPEAQeasy/outputs.html#main-outputs where we explain where each of the pieces (that don't overlap with say the quality metrics already explained in that section) of the RSE objects come from. That is, the rowRanges() and colData() variables. These do change by feature type (gene and exon are the most similar). I think that this will help new users, particularly those learning about
SummarizedExperiment
as well. It'll make it clearer which parts are common to RSE objects and which are parts we compute as part ofSPEAQeasy
.Best, Leo
R
Here's some R code where derived from https://github.com/LieberInstitute/SPEAQeasy/blob/master/scripts/build_annotation_objects.R where I was trying to see if
Length
showed up. These files are similar to https://github.com/LieberInstitute/RNAseq-pipeline/blob/master/Annotation/junction_txdb/feature_to_tx-gencode.R although now I realize that the older scripts havegeneMap
refer tofeatureCounts
files while the newer scripts have different sources ofgeneMap
(one is from Gencode GTFs another is fromfeatureCounts
).Gencode 37
Gencode 25
R 3.6.3
Gencode 37
Gencode 25
R 4.1 (R-devel)
This was the first thing I tried actually.