Abundance Matrix Preprocessing

DavisLaboratory / singscore

An R/Bioconductor package that implements a single-sample molecular phenotyping approach

40 stars 5 forks source link

Hi @DarioS,

In such a scenario, you would generally use the protein coding genes only as those are the only ones represented by the gene-sets you are testing against. Another approach we tend to use is to filter out genes with low expression (rather than those that code for proteins) as this gives a better idea of relative expression (against the entire transcriptome). You are right in that we do not include this in the vignette and this is mainly because we have this information elsewhere (we did not want to duplicate information). The detailed discussion on this matter, along with many others that you would face while using singscore are in the workflow paper we published in F1000Research (https://f1000research.com/articles/8-776). Since this is workflow covers all processing steps, it is much more detailed and allowed us to discuss the implications of each decision point in the analysis. Feel free to ask us for further help on matters not discussed in the workflow.

Cheers, Dharmesh

DavisLaboratory / singscore

Abundance Matrix Preprocessing #24