Dear people,
based on an issue noticed in a previous post concerning survival analysis, i inspected the outputs of normalization of raw HTSeq counts using TCGAbiolinks and the TCGAanalyze_Normalization function, using both geneLength and gcContent options, in order to examine the relative counts for a specific vector of genes, selected for downstream analysis. The relative steps are illustrated in the above code chunk, in which only filtering was not perform, in order not to loose any of the genes.
As you can see from an illustrative subset of the normalized counts, in the second approach with gcContent, for example CEBP shows a high number of counts, whereas in the first case with geneLength, has 0 counts (also this is the case for a bigger number of samples. Also, the same is relatively evident in the other shown genes. Thus, this would not be an issue or concern for any downstream analysis ? For example if filtering is performed after normalization ?
as some genes/genomic features could be lost ?
Dear people, based on an issue noticed in a previous post concerning survival analysis, i inspected the outputs of normalization of raw HTSeq counts using TCGAbiolinks and the TCGAanalyze_Normalization function, using both geneLength and gcContent options, in order to examine the relative counts for a specific vector of genes, selected for downstream analysis. The relative steps are illustrated in the above code chunk, in which only filtering was not perform, in order not to loose any of the genes.
As you can see from an illustrative subset of the normalized counts, in the second approach with gcContent, for example CEBP shows a high number of counts, whereas in the first case with geneLength, has 0 counts (also this is the case for a bigger number of samples. Also, the same is relatively evident in the other shown genes. Thus, this would not be an issue or concern for any downstream analysis ? For example if filtering is performed after normalization ? as some genes/genomic features could be lost ?
Or i misinterpret something here ?
Best,
Efstathios-Iason Vlachavas