greenelab / deep-review

A collaboratively written review paper on deep learning, genomics, and precision medicine
https://greenelab.github.io/deep-review/
Other
1.25k stars 270 forks source link

DeepCancer: Detecting Cancer through Gene Expressions via Deep Generative Learning #157

Open sw1 opened 7 years ago

sw1 commented 7 years ago

Transcriptional profiling on microarrays to obtain gene expressions has been used to facilitate cancer diagnosis. We propose a deep generative machine learning architecture (called DeepCancer) that learn features from unlabeled microarray data. These models have been used in conjunction with conventional classifiers that perform classification of the tissue samples as either being cancerous or non-cancerous. The proposed model has been tested on two different clinical datasets. The evaluation demonstrates that DeepCancer model achieves a very high precision score, while significantly controlling the false positive and false negative scores.

https://arxiv.org/abs/1612.03211

They use a GAN and compare it to an RBM.

gwaybio commented 7 years ago

Computational Methods

Generative Adversarial Network

(183 x 224 = 40,992 probes) - they did not discuss how the probes were ordered.

It was interesting that their motivation for using a GAN was to use the probe weights that the Discriminator learned to classify cancer vs. non cancer. It seems to me that the weights would learn real vs. fake and not necessarily have any direct bearing on cancer vs. non cancer. Makes me wonder why a GAN is used here at all - they do try to compare with SVM and logistic regression but results are not very clear.

Biological Relevance

Other comments

agitter commented 7 years ago

@gwaygenomics Do you think we should include this in the review, and if so, does it belong in Study (gene expression sub-section) or Categorize?

gwaybio commented 7 years ago

Agree with categorize label. Not sure about inclusion in review. Generative model plus categorize is really interesting and I think could have a big impact but this article doesn't really use the generator to its potential. I guess I'd say let's see how the categorize section comes together

agitter commented 7 years ago

We also have the potential to stick something like that in the Discussion. I have been thinking about network architectures that are hot in other domains but not yet widely applied in biomedicine. Those may be areas of opportunity in the short term, and GANs could fit in that category.

cgreene commented 7 years ago

I'm going to drop this for now from consideration for the categorize section. I think that, given the issues of the paper, it's not quite ready to help us determine whether or not this pushes the field forward.