Closed pryabinin closed 2 years ago
The problem was caused by the fact that the ID variable in colData was not the same as the row names of the sample annotation when the MultiAssayExperiment object was created. In order to fix this problem, two changes need to be made, first change the following code from:
colData(mae)$ID <- colData(mae)$bcr_patient_barcode
to
colData(mae)$ID <- colData(mae)$Sample
and then remove the following line entirely:
colData(mae)$ID <- stringr::str_replace_all(colData(mae)$ID,"-","_")
I am trying to reproduce the results from the publication: "netDx: interpretable patient classification using integrated patient similarity networks" (https://www.embopress.org/doi/full/10.15252/msb.20188497). Namely, the part of the paper that deals with classifying samples as Luminal A or Other using a pathway level features (it is under the section "Pathway-level feature selection identifies cellular processes predictive of clinical condition").
I am encountering an issue when I run buildPredictor() where the output's "predictions" slot of the split data is a data frame with zero rows. While buildPredictor completes successfully, running getResults() returns error that is caused by this issue:
The gene expression data is downloaded from here: https://gdc.cancer.gov/about-data/publications/brca_2012
Below is my code, here are direct links to the data: https://api.gdc.cancer.gov/data/3a4451c4-fd0d-4c1e-be49-4fb88bd1beaf https://api.gdc.cancer.gov/data/af7e82cd-acae-4474-b644-9e8e55acfaf1 http://download.baderlab.org/EM_Genesets/February_01_2018/Human/symbol/Human_AllPathways_February_01_2018_symbol.gmt
Create MultiAssayExperiment from data
Prepare data for netDx. Create sample group and sample ID variables
Prepare pathways to group genes
Define patient similarity measure. Only need one for gene expression
Build Predictor
Get results
Examine the buildPredictor() output
Also, I am able to run the vignette successfully.