CRI-iAtlas / ImmuneSubtypeClassifier

An R package for classification of immune subtypes, in cancer, using gene expression data.
Other
40 stars 23 forks source link

TCGA subtype problem #18

Open eacho111 opened 8 months ago

eacho111 commented 8 months ago

Hi, it is very useful package! I download fpkm gene expression data from TCGA by TCGAbiolinks package, and i try to get the immune subtypes of the LUAD samples by using this package.when i compare this subtypes result with the subtypes of this paper(five_signature_mclust_ensemble_results.tsv.gz),their subtypes are different. But when i use the gene expression of paper(ebppSubset.tsv.bz2) to classify these patients,the subtype result can match with the paper result(five_signature_mclust_ensemble_results.tsv.gz).So i want to know if there is any problem to classify immune subtype by using fpkm gene expression data from TCGA by TCGAbiolinks.if i need to do any thing with the gene expression download from TCGA?

Thanks! here are some codes and results! code and result .pptx

Gibbsdavidl commented 8 months ago

Hey there!

Sorry I was traveling.

So I'm not sure, but something I've seen happen is that data is occasionally already normalized; where each gene is normalized across samples. That completely breaks the classification. It's important to make sure the data is unnormalized across samples. Otherwise, fpkm is fine as all other summarizations (tpm, rpkm, etc.)

Hopefully that helps! -dave