Closed bblodfon closed 1 year ago
I think that a bit better documentation would be instrumental for users to know exactly what the datasets are. Another example is the mRNAArray
which is stated as Unified gene-level mRNA expression values. Checking that data type for BRCA
I see normalized expression data to 1 standard deviation, is that what the unified refers to? are these data log2
-transformed?
Hi John, @bblodfon Thanks for reporting. Good catch, we are looking at a way to provide the log2 RPM miRNA values through the pipeline. Best, Marcel
Hi @LiNk-NY,
I also checked the RNASeq2GeneNorm
data, which in the documentation are described as "Upper quartile normalized RSEM TPM gene expression values" but they are count data as well. Could you also have a look at that?
Thanks, we are in the upload stage of the process. We will have the data available shortly via curatedTCGAData
in devel. I will update when that change is ready.
This should be resolved in the latest data version of 2.1.0
or higher (package version 1.23.5
)
Hi,
In the paper and in the documentation, the miRNA data format is referred to as log2 RPM miRNA expression values. I looked a bit some data (see below code) and it seems to be some form of counts? (so not
log2
)?Created on 2023-03-24 with reprex v2.0.2