How to normalize CNA and RNA data before NMF? What is the normalization method in the example data

broadinstitute / PANOPLY

Repository for the Broad Institute Proteogenomic Data Analysis Center (PGDAC) established by the NIH Clinical Proteomics Tumor Analysis Consortium (CPTAC)

Other

33 stars 15 forks source link

The input data in the tutorial are as follows:

RNA data was quantified using RSEM4 and normalized within samples to a fixed upper quartile. The data were then log2 transformed. Further, each gene was median centered.
CNA data were obtained from the SNP chip, followed by normalization (division by 2) and log2 transformation.

The NMF clustering module takes data on a similar scale as proteomics data (ie log ratios to a feature-relative reference), and z-scores are calculated across samples (columns) before performing NMF clustering. See https://github.com/broadinstitute/PANOPLY/wiki/Data-Analysis-Modules%3A-panoply_mo_nmf.

broadinstitute / PANOPLY

How to normalize CNA and RNA data before NMF? What is the normalization method in the example data #29